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PREFACE 



. This- l^iandbobk IS a prbdutqt of t^e Civil Service Commission's' 

contii^i^ing program of manpower planning research. Earlier stages/ of ' 
this i^esearcti have been piiblished in the former Federal Workforce' 

Outlook and Current Federal Workforce Data publications series. 

.1 . ^ ' ■ ■ 

j- Tfiis handbook is divided into two main parts, a narrative text and a 
i.serie^ of appendices. The text deal^.wit.h three pri^jcipal topics: 

Staffing needs pl^pniing policy matters 
■' • i (Chaptet 1);' . - ^ " ' 

/.. ' ' ' ■ . ' 

ManUal methods of projecting turnover losses 
(Chapters. 2-5^;, ^pd 

Compute'P^ methods for analyzing turi^Over, advancement and 

hiring needs "(Ch^pt^rs 6-10). / . . ' ' 

■I ' ' 

* ■ ■ 

The appendices provide , detailed documentation on the manual me*thods 
and the computer prograths', .ihcluding full computer program listings, 
ope^ratioh manuals J^hd technical analyses. 

W:j^gt|Ln the text, all statistics :ar^ fj^esented in an^ elementary , step- 
by-^'step manner'; which thoroughly explains and displays all the techniques 
which are utilized^. This method of presentation was- chosien to accom- 
"modate the wide diversity of skill levels of the audiences^ f or which 
the handbook hfis "been written. Thus' the handbook can be. ui^etstQ,'c^fi-^ 
used by po^le with any Tevel of quantitative knowledge:* 1^1*^^ tho^3&.. 
who have ye^v few quant.lt'-^tive skills ta those who are techfiltally . , 
proficient. In addition, the handbook can be used by oirj^art i zatlon^ 
with any type xDf dat^i Rvstom: frommaitual recorHkeepTT^j? r'> r:opT> i c^f i r g 
^*'>niputer svsrrmp. 

These policies, procedures, and computer programs are offered for 
optional apd developmental use onlv. They may be.used in whole or in 
part by any agency or organization which desires to ^do so. Tn no sense, 
rthen, should! theiiv^se be considered mandatory on any Federal or non- 
Federal office or organization. 

• 

Bein^ developmental in nature, we would fe^^pect that these policies and 
programs will tiot necessarily be! uniformly effective under all types of 
field conditions. We .would welcome; information from users as to their 
experience with this, technology and would, w^come information on the 
changes, or suggested changes, which users niay find necessary or desir- 
able under field conditions.^ 



- iii - \ 



can 



Al^Of jifiie Commission stax^lds ready to p'rovZde whatever assistaAce it 
to organizations interested in studying or applying the technique^ (ion- 
tained herein. ' ' j 

yoii have "^any questibns or inf ormation> or would like to discuss | 
poisslble Con^issioti assistax^ in your organization, please address^: 

^ * Bureau of Policies and Standards 
U.S. Civil Service Commission 
. , Washington, D.C. 20415 
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CHAPTER 1 



' STAFFING NEEDS, PLANNING, " . 

\: ' ■ .... ■ ' ■ ■ ■ • i 

•* . . < ■ • 

Introduction " " ; U 

\ ■ ' ' ; . ' 

, / •■■ 

One of the central ta^ks of personnel «anagemenli^ is to help agency , /' * 
management acquire the numbers and kinds of workers vrequired to carry 
out the brganizatio^i's mission. Obviously^ to do thVs» it is first - ; '; 
necessary to identify what the organization's staffing needs are. -.On^ry ' 
then can personnel management actions and programs to meet these tieea^ " ^ 
l^e planned and cardi.ed out. - ' ' ^ i>, , ^ 

» ■ X ■ ' . ' ; . ■ .^'^ 

In the past, organizational staffing needs have been custpmarily deter/r; 
mined by such means as receipts of SP--52's,'^Requ6¥E'f6Y Personnel Actip^ 
These informed personnel officials of the exigence of .vacancies that 
needed to be filled. In some cases, personnel officials wfere given** 
^.advance information by management of the impending establishmept of:n4w 
or additional -positions which would soon have to be filled. In general, - 
r^it was expected that personnel officials would draw upon both types qf 
.data sources for identifying total organizational staff ii^g^ needs. 

Increasingly, however, these traditional methods have proved 'insufficifent. 
As the Federal workforce has continued to evolve 'from a largely clerical 
structure to a largely technical and prof essional' structure, the average 
federal employee skill level — and thus the average qualification require'- 
ments Steadily increased. This has meant a corresponding in- 

.crease in th^lead time necessary to train the average Federal hire up 
to full-performance level. Lead times ofs2-3 years fox; develQjiment of 
lan average f ull--perf ormance worker^ are now the rule, not the exception. 
, This means that in many fields, trainee hiring must Increasingly be 
keyed, not to current vacancies, but to full-performance vacancie's that 
will be coming up .2-3 years from now. . 

Another recent development has ^dded greatly to the need for anticipating 
future /staffing needs: the advent^ of multi-year Federal program planning. 
Since so much of an organization's personnel management program is>G * 
direct . fxmctidn of the drganization's staffing faeeds, ^he effective 
multi-year planning of such personnel programs clearly requires: that 
future changes in staffing needs be identified well. in advance of their 
occurrence — -a tasjc for which the traditional methods of staffing needs 
identification ^re clearly inadequate. ^ 

For a variety of reasons, then, contemporary conditions require that 
Federal, personnel managers develop methbdS;vfor anticipat^ing future 
organizational staffing need s^ and that; th^se methods be made'' a part of 
the ^iersonnel management functio^ at all i^els. 



This anticipation of future staffing needs — which is^called staffing 
needs planning — is one part of the overall process by which an organi- 
zation plans for its manpower. This total process is called organiza- 
tion manpower planning . To get; a proper perspective on the place of 
staffing needs planning in the total system, we must' carefulLy define 
aMe terms which Aave in the past been used imprecisely. 

Definitions and Responsibilities \ > , \ \ 

The term "manpower" combines the element "-power";, or capacity to do work, 
with a collective element indicating the source of that power: 'i.e., 
"map-r", in i^s usual generic sense of "hioman beings." Originally used 
to refer to the aggregate work capabilities of the total labor force of 
a state or nation, "manpower" has come to be applied%to the collective 
Ikbor force of any. identifiable functional-'etitity—afi organization, 
industry, company, etc. Adding to this the ^ term "planning" gives us, 
"manpower planning", a term in common use throughout the English-speaking 
world (and ill literal translation, throughout the WesterAized wotld) fprv. 
"the systematic planning of and for the manpo\)er requirements of a 
specified functional entity." . g 

Over the years, "manpower planning" has been applied to a variety of 
rather different types of activities. One type is oriented toward 
labor force analysis . This includes both (a) the development of'descrip-- 
tiye statistics showing the significant dimensions and components of a 
specified labor force and their change over time* and (b) the conducting 
of analytical and projective studies of the changes in, and the factors 
relating to, the labor force features thus portrayed. This type of 
activity is tiypifi^d by the work of the Bureau of Labor Statistics and 
the Bxire£^u of the Census. " ; 

A second type of act;ivity isi^^riented toward the development, adirrlni'^ 
^.stratlon and evaluation of manpower prograii^s . This includes both (a) 
programs to improve t;be emplpyinent status o.f particular groups or 
segments of the Xabor force (r.p., equal employment oppqr/tunity programs), 
and (b) programs^ to promote the optimi7ation of manpower snpply/deinflind 
relationships In particular industries and/or o^cupat1ona^ areas. 

Neither the analysis-oriented ;nor the program-oriented types of man^^fewer* 
planning, however, have much in common with organization manpower ^ 
planning ; the systematic planning of the manpower needs of individual 
organizations. Organizational manpower planning has' a different * ; 

purpose, uses dif f erent^^ypes of data, and — especflally — uses different , 
specialized methods ^Mr^^hniques than does either of the other major 
types of manpower planning^ ^ 
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^^o^^^^^^^x,^ ui«»MKww«si. yxcuufxu^ xa maae up or two quite dlstiact ♦ 
aspects. ' ' ^ o ^ I 

The first aspect of organization manpower planning is the planning of 
the numbers and kinds of* workers needed to perfo-rm ^he organization's 
work. This is workforce planning .- 

The second is the systematic estimation of the numbers anci kinds of 
..future personnel management actions which will have' to be taken in 
order to provide this required Workforce. This is staffing needs 
planning -~ 

In theory, these two functions interact to form a continuous, coherent 
organization manpower planing i)rocess. In practice, however, these 
two functions operate, in most government organizations, relatively 
independently. 

Workforce planning—deciding what types of workers, and how many, are 
to be present in the organization—is a responsibility of agency manage- 
ment. The manager may have help in performing this function from such 
management staff as "O&M" • (organization and methods), "OR^' C9perations ^ 
research), or budget specialists. And in many agencies management offi- 
cials, as a matter of policy, consult personnel officials on the manpower, 
aspetets of workforce planning. (Cf., pp 8-9, below.) 

Staffing needs planning, on the other hand— planning the future personnel 
managment actions needed to provide the manager's required workforce — is 
a respojlsibility of the agency's personnel director • 

These two functions also differ in, their purpose and methodology. The 
purpose of workforce planning, for example, is to answer the questions: 

" — What kinds of workers *will be needed? 

— At what skill levels? 'And - 

— How many of each? 

The purpose of staffing needs planning, on the other hand, is to answer 
the questions: ^ 

— What types of futute peiaiohnel actions, and how many, will be 
needed to provide mainagement ' s planned workforce? 

r- Will providing the workforce be feasible? (If not, what 
changes will be needed?) And 

— What will providing it cost? 



On the subject of methodology, we would peine out that th^ methodology 
.of workforce planning, in its most typical form, is to start with (a) 
estimates cyf expected workload, to vmich are applied, (b) measures or 
assuppti^oM of output -per unit of labor tiiife, and by this means "to 
dezT^e estimates of (c) the numbers and types of workers needed to 
produce the expected wot'kload. Workforce planning methodblogy, that la, 
uses workload and wot'k measurement data and derives "required workforce" 
plans. 

'I 

The methodology of staffing needs planniu'g, on the other hand, is to 
start with (a)" the manager's "required workforce" plan, estimate (b) the 
personnel losses and shifts likely to take place during the planning 
period, and then to determine (c) what future personnel management 
actions will be needed to provide the required workforce. Thus, 

■ staffing needs planning uses mostly personnel data and produces either 
suggested changes in management's workforce plan or summary estimates 

^of wkat must be done to provide the required vjprkforce. 

It is the purpose ot this handbook to set ^orth in substantially full 
detail (a) the policies and features which characterize effective 
staffing needs planning programs, and (b) a specific bfsic technology 
for performing the key analytical functions of such programs, 

SuK>^ested Policy Requirements ^ 

From the above discussion of the distinction between workforce planning 
and staffing needs planning, ^t is clear that a definitive manpower 
planning policy is needed which will make explicit both (a) how these two 
functions differ, and also (b) in what respects they are interdependent 
and must function in close coordination. 

In general, we suggest that an effective overall policy would provide that 

1. Personnel management officials should recognize the workforce 
planning aspect of organization manpower planning as the 

• direct responsibility of agency management; that 

2. Such personnel officials should provide, information and 
assistance to management and <^management staff in the 
performance of their workforce planning tesponsibilities 
under policies and requirements established by higher 

^ management authority, the^ Office of Management and 
3udget, the President and the Congress; that 

3. Agency management officials and management staff should 

^ recognize the staffing needs planning phase of maivpower > 

planni^^g as the direct responsibility of organizatiqn 
personnel management; and that 




4. Such management officials should provide information and 
^/assistance to personnel officials in their performance of 
the staffing needs planning, responsibilities under ^ 
alleles and requirements established by higher authority, 
the Civil Service Commission, the President and the Congreos. 

In addition to a general policy, guidelines for the basic functious of 
staffing needs planning pr£>grams should be spelled out in concrete > 
detail. We suggest tha'fe specific reference be made to the following; 

1. information Functions - There should be provision for: 

The regular transmission to organisation persvpnnel 
officials of detailed data on the workforce structure 
established or proposed by management for each phase 
of the organization's planning period; and 

The regular transmission to organization management 
officials of the types of personnel management and 
staffing needs planning information detailed under 
"Workforce Planning' Functions/' below. 

2. Analysis Functions - Provision should be made for the '> 
regular,, scheduled analysis of manpower resourjces and 
personnel transactions data. This may 'include such 
specific analyses as: ^ 

Treiids in the ocoupationai , grade or skill level, 
and/or pay distribution of organization workers in ' 
specified occupations, functions and/or organization 
segments; / 

Levels and trends of workforce composition in parti- 
cular occupations or functions ffy such dimensions as 
age> sex, minority status and/or length of service; 

* . * • 

Levels and trends of employee retirement losses and/or 

eligibility; % . 

^ - The pattern and-Zor trend of workforce accession actions 

(number and percent of outside hires at entry levels, .etc .) ;. 

* ^ ■'^ . 

Levels and trfends in occupationai^adVancement patterns, t 
inter-occupational flows, etc; and/or. 
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- Levels and trend^iiof .oetupatlonal and/or organl- 
za!tlona.l loss rates due to quits, transfers, occupa- 
tional shlf!:?^ etc. ^ 

Staffing Needs Estimating Functions - Provision should be 
made for the systematic estimation of current and future 
sta,fflng needs In key organizations, occupation^ or 
specialties. This may Include such specific activities as: 

- Analyzing the net workforce changes which \firlll be 
required during. each phase of the planning period 
under, management* s actual or proposed workforce plans; 

Projecting the numbers and types vacancies that are 
likely to occur during each phase of the planning 
period due to such causes as turnover losses, death, 
disability, retirement, management actions, etc.; 

- Projecting, by and/or with the aid of personnel 
functional specialists (staffing, training, etc.), the 
numbers and kinds of personnel management actions 
which will be necessa^cy under current or proposed 
personnel policies to provide the required workforce 
wtien, where and with the skills needed; and 

The preparation in c^tw^mient, hard-copy form of 
written svmmaarles of the results of such staffing 
, needs estimating activities. 

Workforce Planning Functions - Provision should be made for 
the development and the transmission' to management of 
personnel management and staffing needs* data and analyse^-^ — ^ 
which are needed for the effective performance of workforce 
planning functions. This may include such data and analyses 
as: ' . * ' 

- Detailed assessments of the feasibility of providing 
managemient* s proposed workforce, b^sed on labor market 
limitations, th^ numbers of employees' in the 
training pipeline, etc.; 

- .Detailed analyses of the means necessary to staff the 

•proposed worl^orce (extent of outside hiring at 
entry level necessary, aiftount of employee training or 
re-training needed, etc.); 



Estimates of the direct and indirect costs of the 
necessary personnel staffing actions (coat of 
recruitment, training, employee relocation, sepa- 
ration and leave payments, etc;); 

-Information on the impact on workforce post/estimates 
of new employee salary or pay schedules, /f6b or 
occupation reclassification actions, employee grade 
distributions, etc.; " * ^ 

- Analyses of the impact of the proposed program of 
staffing actions on (a) the existing workfor.ee 
(advancement or retention opportunities for women, 
minorities, handicapped, etc.)_and on (b) the 
organization's responsibilities for implement in4. 
public policies (EEO, upward, mobility, olde'r ^ 
workers, veterans, etc.); and 

Recommendations for changes in (a) manag^ent's * 
workforce plans (occupational or skill-levef" trade-- 
offs, etc;) and/or in (b) organization pers^imel 
policies or practices (policy on outside hiring, 
etc.), based on the above. 

Personnel Program Planning Functions - Provision should be 
made for the utilization of the above-described data and 
analyses in the establishment pof operational goals and 
objectives for the various personnel management functional 
activities and in the development of personnel management 
budget estimates. 

J ■ 

Data System Functions - Provision should be made for an 
effective data system for obtaining, recording and 
furnishing . the data needed to support staffing needs 
planning functions. Such systems should include specific 
provisions for obtaining and recording needed data on the 
workforce plans and prToposals made by organization manage- 
ment as part of the workforce planning and budgeting - 
process. « . 

Evaluation Functibns - Staffing needs planning systems 
should make specific provisions for the regular and systematic 
evaluation of the above-described manpower planning policies, 
procedures and products, and of their contribution to the 
overall planning, budgeting and personnel management?^ systems 
of the organization. - 
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Plan for This Handbook 



As will be appreciated, addieoslng all of th^jjj^bove tuncLlOafa lu^oue 
single basic handbook Is not feasible. AccO-rdingly , this hanJbook 
focUsses j^n what Is perhaps the pivotal technlcalslunction pf staffing 
needs planning: the- estimation of future Vacancies and, in particular^ 
on the analysis and projection of employee turnover. 

For more than seventy years American researchers h^ivc aoui^lit in vain 
for a simple, reliable method for analyzing and projecting turnover. 
Scores of methods have been trie*d« But each has failed when tried iu 
new situations — or even" in the same situation after a lapse of time. 
Now, however, effective methods are available. 

The luethods set forth la this handbook had their origins In Laolc 
research done at the. VI -S. Civil Service Coimuissiun during tiife peri^a 
1966-70. y Thi s research has been subsequently confirmed as being 
consistei^t with long-term actuarial studies in Great Britain and ou tiic 
Continent and the methods can be considered proven effective and 
reliable. 

Needless to say, the author & ot this iiandbook ac<^epc Lull i c^ouci i L i 1 1 
for the technical ef tect^-ven^sss and adequacy of the speclilc methods 
described. Any errors or weaknesses are ours alone. 

In preparing the various sections of this handbook, we have tried to 
follow a four-^t^p procedyre. First, to describe brletly the specific 
nature of the prbblem at hand. Second, to describe more or less in 
detail the tec]inology which applies to the problem. Third, to provide 
a detailed, step-by-step method for making the necessary calculations 
or manipulations. And fourth, to describe the features and capabil- 
ities of' the computer program^ provided to perform each analytical step, 

A number of considerations ^were involved In the design of the computer 
programs. From , the outset, we determined that they should be; 

1. General in application — usable by ahyone in any organi- 
zation; . 

2. Complete and beli;~standing — all calculations, including 
statistical te^ts, to be done by the program; no statistical 
background required to operate them with full effectiveness; 



i./ Reported to a ^September 1971 NATO Conference in: H. L. Clark, 
"Problems dnd Progress in Civil Service Manpower Planning in the 
United Stateg^" reprinted in Manpower Planning Models » Clough, 
Lewis, and Oliver, Eds., English Universities Press, London, 1974. 
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3. Tephnlcally compact and Simple — programs are written In 
Fortran IV, one of the most AjCOmmon computer languages, and 
/ use only elementary commands and a minimum of core; any time- 
sharing service can rt^ them; and 

4 

^* Fully-documented — For each program^ we provide: 

a) -Program Listing— -a complete listing of the entire 

program and all required subroutines. 

'I * 

b) Operation Manual" — a complete set of step-by-step 
instructions for the program operator. 

c) Technical Analysis—a detailed analysis of what the 
program is doing and how results are obtained. 

Taken together, we believe that this handbook will give both the 
newcomer and the experienced worker a sound grounding in the central 
analytical tasks of staffing needs planning. Prior training in 
statistics is not assumed or required — though, nee'^dless td say^ it will 
be helpful if the reader has it. Some aptitude for quantitative work, 
as well as some elementary algebra, are required. 

For all, we hope this handbook will open new doors to solutions to old 
problems. tf it does, we shall; feel amply repaid. 
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CHAPTER 2 
ANALYZING WORKGROUP LOSSES 



The Technfcal Problem 



Thre pivotal technical problem in the estimAtion of future vacancies is 
the analysis and projection of turnover losses. This can be clearly 
seen from Figure 2-1. 

Notice that the group of positions labelled '^Growth" can be readily 
determined by simple subtraction of the starting population (left bar) 
,from the projection-period's ending population (right bar). What the 
level of "Losses" will be, however, must be estimated by some other 
means. 

Means for estimating some kinds of losses are readily available. Losses 
from death, disability, and retirement are termed actuarial-type losses^ 
because actuarial tables for estimating such losses have been available 
for many years. These are simple "annual loss probability" tables based 
on the employee's age and sex. Samples of such tables will be provided 
later. Here, it is only necessary to note that tables for making such 
estimates are readily available. 

There are do actuarial tables available, however, for other types of 
personnel :^sses* (voluntary quits, etc.). A variety of tecimiques have 
been tried Wer the years to deal with such losses. But it has only 
been recentlV with the development of automated personnel data files, 
that data have been made available to permit effective- large-scale 
research to be conaucted. 

The U.S. Civil Service. Commission has been actively engaged in manpower 
planning research, and particularly in the study of turnover, since the 
establishijient of the Federal Personnel Statistics File (The "10% Sample") 
in 1962. This File contained a continuous work history sample of all 
Federal employees whose Social Security number ended in "5." From this** 
initial this" sample has been expanded to its present 100%. In both 

*its 10% and 100% forms, this sample has made it possible to study turn- 
over in a degree o'f detail never before possible. 

It had long been known, of course, that tutnover rates differed substan-r 
tially by. occupation. With the advent of a computerized data base, 
however, we have been able to study turnover in ^particular occupations 
in great detail. And this research has taught us a great deal. ^ 

Before we take up what has^ been learned from this research, one prelimin- 
ary point should -be clearly understood, ^n this discussion— and indeed 
throughout all pactions of^ this handbook~we are dealing with retention 
and turnover of non-temporary employees only . Losses among temporary 
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employees, such as separations at the end of the job's planned duration 
(e.g., termination at the end, of an appointment "not to exceed 90 days") 
are not dealt with in "-this pamphlet. 

■ > 

With it clearly in mind that A^e are talking about continuing employees 
only, th^, let us see some of the things tj^at havfe been learned from 
turnover research. ' » , 

The Results of Turnover Research 

_ ^ 

Just about the first things you learn when you begin to otady occupy" 
tional turnover 1./ closely are that turnovei: is a function of employees' 
lengths of seirvice;, and that most turnover occurs in the period iiranedi- 
ately after hir^e. Additionally, you learn that the turnover curve is a 
very peculiar kind of animal indeed. 

For example y some years ago we took a ^^roup of low- level clerical lilres 
and followed them over a three-year span, counting at the end of each 
year how "many were left of the original starting group. The figures we 
got looked something like this: 

Start 100% 
^ End of 1st year 58% 

{ ' En^^lj^and year 44% 

End or 3rd year - 36% ^ 

Wlicu graphed, these points look like this: 
100% ^ 



507c 




l./The occupational analyses described in this handbook do not need to be 
; done for every occupation within an organization. They may be restricted 
to an organization's major occupationj^. Statistically, the techniques 
are more effective for occupations (or groups of hires) with 32 or more 
^ employees. . By use of the techniques in Chapter 8, it is possible tc 
combine smaller occupations into larger groups if a user so desires. 
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Now, we know^rom'iong actuarial experience based on our Civil Service 
Retirement Fund records that ^about 5% of such hires remain in the ser- 
vice/all the way to retirement age. Thus a full 30-year curve for this 
''-—group must look something like this: 



1007,s 



507,-. 



10 



20 



'20 



That's a rather striking looklhg curvefK^sn't it? 

Let us look at some of its/properties/ ./klrst, we see that 64% (100% - ' 
36%) of the ^^al of 95% who will leave' before retirement are 'already 
gone by the ^ of the third year after /original hire. And of these, 
42%— or more than half of the 3-year totSl losses of 6Af%~left, in the 
first year alone. ^. /v^^ - 

When we first started seein|v/i^hrves like this, we made up a simple "rule 
. of thumb" to describe this concentration of losses in the early years. 
This "rule of thumb" went something like this: 

"In any given group of hires, from two-thirds to three- fourths of 
all of the quits that will ever occur prior to retirement age will 
have already occurred by the end of the first, three yea^ of ser- 
vice. And of these, more than half will have ocdurired by the end of 
the first year alone." 

We found this rule of thumb us^^ in getting a(^B^ to people' what a 
high proportion , of quits occu't^in^he first few^«s of service.. And ' 
it was useful in showing that the data needed to measure turnover were 
retention rates aver tlme—i.e*'. , longitudinal data. But' it really was 
not much help in, a computational sense. 

The second major thing we can see Srojn this type of data is that the 
turnover curve is not the simple kind of curve' people have thought it 
was. For example. In this sample we see that 42 out of every 100 in our 
.starting group left in the first'^year, another 14 lef t °in the second - 
year, and 8 more left in the thiijli year. 'obvioS^y, then, we r.annot 
just say that any one fixed number of people are leaving our group 
during each year of service. 
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Well, how about a fixed percentage of people le^v: 
of the^eople on board at the^&ptrt of the yea ' 
year"? This Is;^ t;he kind of..--rid!ng ''most people 
take the overall annual Ij/Slsis rate o,f ,the workf 

"Ihe turnover .rate/' ^^his .1^ ai^^o the appr^««i usea m somei kinds of- 
..statistical modelling techniques, utilizing trariati^on mattlc^^^^r "Markov 
Chain" methods. ' K 



be lost during the 
\bf when they simply, 
s an adequate measure 
used in sqm^ fciiids of- ^<^ 



The reten|:io^ data show that this approach will not Ti^otfc eitfeetV'' ' In tlie ' 
first year, ^of ea^ample, we. lose 42 ,out 100 or^AlZ. \ln the second 
year, 14- btit of^58 br^24:?. - And the ' third^-yfear^ 8 out of 44 or 18^. 

J ■ ' V-.'j/^ • * 

What we can see from these data, then, is that (a) turnover is by f ar^ a't 
its heaviesj: in'the.first three y^ars afte/ hire—and especially in the 
first year —but (b) the shape of the turnover curve is by no means^the 
simple type^ of thing most people think it is. 

The Log-Probability Nomograph 

To show you just what the turnover curve really is, w€ 
is called the "log-probability nomograph" (Figure 
for using it are written right on it. 



must look at what 
The instructions 



First, you obtain data on what percent of a given group of hires 
are still on board one year after the date of original hire and 
locate this point on the "1st year" percentage scale. j 

Second, you take the percent the original group that a(e left 
after the second year and locate this point on the "2nd year" per- 
centage scale. 



Then you take a straightedge and draw a line through these points 
and on through the retaining years' scales. Where this line hits 
the scales for the ^?f«T:hrough 7th years indicates the petcent^ge 
of the original starting group that are likely to be left 'after 
each year. 

As you see, a sample line har& been drawn on the nomograph using the data 
points that we gave earlier!. From this sample line, we can now extend 
the number of data points \S/e> have up to seven: 



Year 



% Lfefi 



Ist 
2nd 
3rd 
4 th 
5th 
6th 
7th 



58 
44 
36 
30 
26 
23 
21 
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Having seen one turnover curve for one group of employees, let , us plot 
the curve of another group. This time we take a group of professional 
hires and we find that in this group some 83% are. left at the end of the 
first year and 75% are left at the end of the second year. Plotting 
these two points on the first two scales and drawing our straight line, 
we read from the remaining scales (Figure 2-3): 



Year 

3rd 
4th 
5th 
6th 
7th 



% Left 



70 
66 
62 
59 
57 



^rom these two examples It is clear that the log-probability nomograph 
can be used to plot the long-term retention curve for any cohort (1 e 
starting group) 'for which we have longitudinal data (i.e.;, measurements 

^over t Ime) . o- ' 

Suppose, however, that we have mbre than just 2 years' data— say, three 
or four years—and that when we plot" them v^e. if ind that they are not 
quite in a perfect straight line. (Since there is almost always soke 
random variation around any norm, this can be expected to be the usual 
case;) When this occurs, nil of ^hp'^avMT«bIe points should be used in 
drawing the final straight llnp. ■• 



Suppose for examrle, th-t -e have four points Hke rhis (the^vertic"! 
separation of rhp 



In this case, we simply reduce the number4.£ points to two by connecting' 
the first two polnti^together and the last two points together with 
straight lines and locating the midpoint of each line. The final llnV 
is then drawn • through -these two midpoints thus: 



-,19 - 



27 




If there- are only three points, the solutidn is not quite as satisfac- 
tory, but will give a reasonably good approximation of the correct line. 
For three pointy, we take' a pencil or other suitable marker and draw one 
small circle arWd the first and last points and two circles around the 
middle point, like this: ' 




Ideally, you should then be able to draw the final line tangent to the 
circles around all three points. If they are still too separated for . 
that/ however, the final line should be so d^awn that the total distano 
from the line to the circles around the first and third points combined 
is as nearly as possible equal to the distance between the line and the 
outer circle of the middle point alone, thus: 




^4"!* there .really is no simple method other than 
to say .that the sum of the distances from the line to points above the 
line should be as nearly as possible equal to the sum of the distances 
from the line to the points below the line. Obviously, the nomograph ' 
method is not well suited to large numbers of data points. 

Application to Hiring Levels 

Let us return to the data table we developed using the first set of 
retention data. There are a great many useful points that this table"^ 
can illustrate. 

First, let us apply this turnover curve to the case of an organization 
that has been hiring exactly 100 employees In this occupation each year 
for the last 6 year%. A length-of-service distribution of the employees 
Still on board would look like this: 

Years Since Hire Niimber 

6 23 
5 26 
4 ' 30 

3 36 
2 . 44 



Tot-aT 



58 



217 



As yoM see, this looks iust like our orig<nal retention table axiA if 
graph fhese values, the result would look just like our turnover curve, 
bo the first thing we cap learn from studying the turnover curve is 
that--othor things being »qual, of courae—thp length of-sorvlc- distrl 
butlon of an ocrupatlon's workFrr^o „m t^^^ , ^ oo.c»ir« 
tlon p trurnover curvf»» 



\ 



In actual practice, of course, most organizations don't tend to hiro the 
same number of people every^^ear. Rather, you^hire. more 'some years and 
less In others. But though tWe ups and downs in hirlnglcause corres- 
ponding ups and downs in the leWh-pf-service distributioL, the turnover- 
type curve can usually be found IS you know how to look for it. 

For example, one organization's lekgt|-<^^service distribution for 
emp,loyees who had completed at leAst ion# yea^ of service looked Hke 
this: \ ' ' ' ■ 

* 

Years Niimbeg^ g^ Years Niimbe r 

..• ' 1' 92 ,6 70 

2 77 7 35 

3 55 . 8 34 
^ A 67 9 48 

5 95 
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(We will use only a few years of data to keep this simple,) 



Obviously, when we "^ot these pointa they do not loot very smooth: 
100, 




1. 2 



The ups and downs can be smoothed out, however, by using a quadratic 
centered-moving-average technique. To do this, you choose an. odd number 
of points—say, three or five— and taking the middle point as the centei;^ 
find the quadratic mean of the center point plus either one or two 
points on either side. In this case, for example, we use a, 5-point 
centered-moving-average and we compute the first points like this: 



CMA for year 
3 
h 



is equal to 

^ 92 X 77 X 55 X 67 X 95 

^^7 7 T ifiT x X 70 



This 1*» most eaBlTy done on a calenlator-. 

Step 1. 92 X 77 X 55 X 67 X 95 ■= 2.4799^ x lo" 

Step 2. log (2.47993 x 10^) - 9.39AA/, 

Step 3. 9.39444/5 = 1.878888 

Step 4. antilog 1.878888 =75.7 

And so on. By this means we find that iking a 5-polnt quadratic cm. a; 



for these points gives: 

./ Center Pt. 



3 

* 4 
5 
6 
7 



\ 

CM. A. 

75.7 
71.6 
61.2 
55.6 
52.0 
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PlottiSig these points on our original*" £raph gives: 




As you can see, the smoothed data points look much more like the turn- 
over-type curve we were looking for. As you can also see, however, the 
tendency In this organization Is for new hiring to go sharply up and 
down In quite regular cycles over a multi-year period. Many organiza- 
tions show such cycles and their Identification, thrpugh study of length- 
<rf-servlce distributions, can often be lielpful In projecting future 
hiring ups and downs. 

Let us return, however,, to our hypothetical length-of -service table and 
see what we can say about the level of turnover we can expect to occur 
In this group In the next year. Using our turnover cuxrve, we estimate 
the following losses: 





Y^ars 
Since Hire 


Number. 


Number After 
1 Year 


Lost 
"In Year 






23 


21 


2 




5 


26 


23 


3 




4o 


30 


26> 


4 




3 


36 


30 


6 




2 . 


44 


36 


8 




t 


58 , 


44 


14 






217 


180 


37 



Thus, we estimate turnover over the next year as 37 out of 217 or 17.1%. 

Now, however, there Is a very Important point we dan show. Let iis 
Illustrate It this way. Suppose we decide that we are going to go out 
and hire another 100 people now*.* What effect will "this -have on next 
year's turnover? Let us add these new hires to our table as follows: 
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Years 
Since Hire 



Start 
of Year 



6-1 217 
0 100 
Totals 317 



No. After 
1 Year , 

180 
58 
238 



Lost 
in Year 

37 
42 
79 



As you see, our turnover estimate for next year is now 79 out of 317 or 
24.9%. 

Suppose we tried other levels of new hires such as: 



Number of 
Hires 

0 

50 . 
100 
200 



Number 



r 



37 
58 
79 
121 



-Expected Losses- 

OutNpf^ ; 

217 
267 
317 
417 



Percent 

17.1 
21.7 
24.9 
29,0 \ 



The important point these data illustrate — and it is a very important \^ 
point indeed— is that the turnover rate of any given group is directly 
dependent on the rate of new hires being added to the group . If new 
hiring goes up or down, so does the group's overall turnover rate. 

One reason this point is important' is that it gives us ,an explanation 
for something that labor economists have often observed: Tlie tendency, 
for government's overall turnover rate to go up when the economy rises 
and down when the economy declines. 

What happens is not that there is any change in any existing group's 
length-of-service loss curve — on the contrary^ these curves tend to hold 
their steady course no matter what the state of the economy* Rather, 
what happens is that what goes up in good times and down in bad times Is 
the overall level of govemjroent hiring. 

When you hire more, you have more new recruits in your workforce and, 
since these are lost at much higher rates than your longer-service 
workers, your overall loss rate goes up. When you hire fewer new 
people, the process is exactly the. reverse. Fewer new recruits mean 
fewer turnover losses and a lower overall turnover rate. 

A second reason this point is important is that it gives us new insight 
into what causes differences or changes in the turnover rates of parti- 
cular organizations 6r groups. Very often people Interpret changes or 
differences in turnover rate as reflecting poorj morale, outside competi- 
tion, inadequate pay rates, bad management, poor organization practices, 
-etc., when almost invariably the real causes l(ie in hiring practice 
differences or changes. 
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And finally, this point is important l^ecause it shows that the nomograph- 
type of turnover projection can predict an organization's future turnover 
levels even when the number of new hires to be added to. the workforce 
(or, the total size of 'the workforce) represents a major departure from - ' 
the organization's past trend * * 

This is a virtue of very special usefulness. It is one thing, after 
all, to be able to predict future turnover needs when the organization 
is on a stable trend,. with little change from year to year. Any trend- 
projection technique from "irules of thumb" to Markov chains can make 
good projections in thes^ cirtumstances. But it is quite another matter 
to predict turnover when abrupt and unprecedented change from past 
trends is in the works. Yet it is at times of such abrupt change that 
effective planning is most needed. Techniques that can handle such 
change situations, therefore, are very useful indeed. 

'I 

Other Applications » , 

Although we have now seen what is perhaps the most important conclusion 
to he gained from discussing the nomograph, we should not conclude 
without: at. least a brief mention of some of the other useful applica- 
tions that can hi made of. our turnover curve table. 

First let us return to the basic table we made up for-* the turnover curve 
shown on the homogjraph:^ 



Year 


. % Left 


Start 


100% 


1st 


58 


1 2nd 


44 


3rd 


36 


4 th 


30 


5th 


26 


6th 


23 


7th 


21 



From this basic table, we cap compute; 

1. The probability of a new hire's lasting x years. E.g., the 
probability of a new. hire staying 6 years is 23/100 ,or 2^%. 

2. The probability of an employee, of x years service staying 
until year y. E.g., the probability of an employee vitK 3 
years staying to the end of 6 years is 23/36 or 64%. 

3. How many people have to be hired now -to have x people on board 
y years from now? ET.g., the number of hires needed to have 50 
people on board 3 years from now i« equal to 5 
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= 100 
50 36 

. V 36x f 5000 • . 

r ' . - ■ 

X = 5000/36 = 139 

4. Asstxmlng a training course costs $1000 per employee, how much 
money must be spent on training new hires now for every em- 
ployee needed on. boardT 3 years from now? E.g., the cost of 
the training divided by the 3-year retention probability is 
equal to : - 

$1000/. 36 « $277a , ' 

As you see, there are a number of different kinds of computations, that* 
can successfully be made^ from just the limited amount of dara '^n our 
basic table. ^ 

Conclusions • 

Let us conclude our discusgion of the nomograph method by summing up 
this method's advantages and drawbacks ^ 

JOn the one hand, it is easy, quick, and cheap'. The nomograph shows 
right on it the data that are needed and how to draw turnover curves. 
Anyone can use it — it requires no mathema'tical or st^atistical expertise. 
Anyone who has the ability to read a scale and draw straight lines can 
make seven year projections from simple data. 

These characteristics make it very handy to use in small groups, under 
field conditions, or by such skilled scale- vfeaders and draftsmen as 
craft s-and-trades employees or foremen. Also, since the seven scales 
can be taken as representing any equally-spaced time units — seven weeks, 
seven months, seven quarters, etc. — the. nomograph method can be used on 
shorter-term, as well as longer-term naroblems. 




Finally, the method uses data that Usually are readily available in 
organization records. Even in orgmizations without detailed personnel 
data systems, there are usually pay records showing the number and types 
of employees hired in. a given period and how many were still on the 
rolls at selected subsequent time periods. 

llhese advantages make the nomograph method very useful in many common 
types of sittiations. It is sufficient, indeed, for many practical 
situations. 

The nomograph method also has, however, some truly significant drawbacks 
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Maniial line-fitting gets very hard .if you have more than four 
points; ^ 

- When retention observations are unevenly spaced, the correct 
placement of points in between scales is hard to achieve*; 

Nomograph results ate. difficult to apply directly where the 
group's hires have been made over a span of t-ime rather than 
being bunched together; 

The nomograph curve directly applies to employees in the first 
seven years of service only; , 

" - It is hard tj^ apply nomograph curves to mixed length-of- 

service groups, such as an organization's oyerall workforce 
\ with its typical mixture of all different lengths of service; 
'i^ and v 

While the nomograph method shows the most probable fevel of 
future retention, it is d;Lfficult to estimate how reliable 
that projection is and how much variation can be expected 
from these projected values^ 



Clearly, ^?ffl.le. the nomograph method i^ both useful and instructive, 
effective staffing needs planning requires more sophisticated, more 
fle^^Jble^ and more powerful Inethods. Such methods have been devel- 
oped and are available. And it is to them that we now turn. 

y 
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^ CHAPTER 3 
ON CURVE FITTING 



During our discussion of the log-probability nomograph, you saw how 
approximation methods could be Used to fit a straight line to a plotted 
set of retention data. The line we wjr e lookitlg for there was the one 
which would a)est represent all of the plbts. The purpose of the 
statistical method described in this chapter is exactly the same as that 
used on the nomograph—f indixxg the best line*!/ 

\ 

To develop this method ^ we need to know how to: 

1. Determine the formula for a straight line from a given set of 
data points; and , . 

2. Transform longitudinal retention data so that it can be fitted 
with a straight line. 

The General Formula For A S traight "Line 

■ . ' ■'^fi ' .k — ^ 

■■ .\ ; ' 

To beg^ the discussion of a stati^tlfcaX method of fitting a straight 
line to a given set of data points, wfe must shift our attention from 
points plotted on a nomograph to points plotted on a standard graph 
(such as the one on which our original retention cur^e was plotted) * 
This graph consists of two perpendicul^^r lines. The vertical Hup Is 
called the y-axls; the horizontal ilr><> called the x-axip, 

Cone^dpr flrwt the following gr^rb? 




i/Readerq familiar with the linear least-squares curve-fitting method, 
described in this chapter, may proceed to Chapter 4. 
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The three points plotted.' on the graph cai^i be represented by the followf- 
ing table: . 



x-axls 



y-axls 



0 
1 
2 



This means, for example, that the value 0 on the x-axis is matched with 
the value 1 on the y-axis, ^We will reffil/to the values on the x-Mcis as 

x-values and to those on the y-axis as' y--values, 

f . ■ . ■ ■ 

Connecting the three points on the graph gives a line which is parallel 
to the X-axis: 




Since for each value pn the x-axis (i.e., for each x-value) the corre- 
sponding y-value on the line is 1, the formula (or equation) describing 
this line Is written: 



1 



1. 



Dte that this relation also holds true for x «= 0. It can be said that 
this line intercepts (or hits) the y-axis at 1. » 



. > 
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To generalize., j:hl8 concept, consider any /ine parallel to the x-axls and 
let the dist^ce from the x-axis to the ^olnt where the line intercepts 
the y-axis be represented by the let tier a as follows: \ 



6. 

5. 
4- 

3- 
2. 



-1 1 1 L. 



1 2 3 4 5 6 



Using the same reasoning as before, we can say that this line is repre- 
sented by the equation: . 



y = a 



The point at which a line intercepts the y-axis is called, quite natur- 
ally, its y- intercept . 



Now suppose we make a slight change in our set of points and 



use: 



0 1 

1 3 

2 5 



Then our line looks like this: 



We -still have a value for a (the y- intercept) of 1 but obviously some- 
thing else is needed • This "something else" is a factor, which measures 
the line's departure from being parallel to the x-axis''.. Or, in other 
words, a factor ^^ich measures the amount by which y differs from a. 

Consider the following diagram: ^ ' 




Ncwte that for x"= 1, y-= 3, ory«= 1 + 2= a + 2. And for x - 2, y = 5 

= ^1 + A a H- 4, These two values of y can also be written still another 

way : 

y-a + 2- a + 2(1) 
y-a+A-a + 2(2) 
Note that the values In the parentheses are the corresponding values of 

X. 

So, for X " 1 ♦ 

y«53«= 1 + 2* a+2(l) = a + 2x 
. it 

And for x 2, 

y»5«l + 4«=a + 2(2) = a + 2x^ 
Now, replacing the number 2 by the letter b, we have: 

y «= a + bx , 

This iB a generalized -Equation for a straight line. And b is referred 
to as the slope of the line, 
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Otn: example, above showed b as a positive number. But b can also -be 
negative. For example, consider the points:* 



The graph of these>point8 Is: 



0 
1 
2 



1 
-1 

-3 




The value of a is still 1. But in this case, for x = 1, 

' y=*-l = l- 2='1- 2(1) 

And for x = 2, 

y= -3 =1-4=1- 2(2) 

So in this example the equation is: '* 

y = 1 .-^ 2x ■ ^' 

And b = -2. (In our analysis of retention data we will find that the 
value of b is ^always negative.) ^ . 



To sum up then, we have: 



A general formula for a straight line is: 
y a + bx 

where 

a « the y- intercept 
b = the slope 



Once the equation for a line is known, it is possible to use that equa 
tion to solve for a value of y for each value of x. 

The Linear Least Squares Line 

Now that we have a general formula for a straight line, we want to be 
able to use it to find the line which best fits a given set "of data 
points. To do this we need a method to calculate values for a and b 
using the x- and y-values of the data points. 

• ) 

pIb an example, let*s use the points: 

X Z 

1 1 

2 3 

3 4 

You can see from t,he graph below that these points are not exactly in 
line: 



6 


p 




. 5 


r ^ 




' 4 


0 


1 


3 


m 




2 






1 


• 




J 


1 2 3 4 5 6 
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^The first step in the procedure to find the best line to fit these 
^points is to set up a work table with these headings: 

The fir^ column of the table will list a^of the x->values. The second 
columejiill^list the y-v^ues (with each given x, y pair~i.e., point- 
on thfe same line of the table). For the third column, each 3?~value is 
maltiplied by its corresponding y--value and for the fourth column, each 
x-value is multiplied by itself (i.e,, squared). After the columns are 
filled In, the values in each one are added to obtain a total ^figure for 
each column*. 

Using our sample set of points, ''a work table would look like this: 



X. 






EL 




1 




1 


1 


1 


2 




3 


6 


4 


3 




4 


12 


9 


6 




8 


19 


14 



There is a set of two equations which uses the summed values in the work 
table to calculate values for a and b, . To use these equations, we need 
a special symbol to express the sums in shorthand form. This is the 
capital Greek letter sigma CD . The symbol is read as "sum of." Thus, 
Sx is the sum of all the x-values, ly is the sxim of all the y-values, 
£xy is the sum of all the xy-values, and 1x2 ig ^he sum of all the x2^ 
value^ 

We also need a symbol to represent the number of known data points and 
we will denote this with the capital letter N. In the example, N equals 
3 (since there are 3 pajjits) . ^ 

The equations that we arte going to use to find a and b are: 

Xy Na + b^ 
vXxy = aSx + hSx^ 

(These equations are known as the "normal equations" and their derivation 
may be found in any basic statistics text.) " ^ 



r 
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To use these equations, values from the work table are substituted at 
the appropriate places. From our sample table we have: 



2x2 
N 



6 
8 
19 

3 I 



Substituting these values into the normal equations, we get: 

8 = 3a + 6b (1) 



and 

19 « 6a + 14b 



(2) 



Now we have two equations with two unknowns (a and b) and to solve them, 
we^ use the technique of simultaneous solution. For those whose memory 
of this technique is ^i4ja, what follows is a short refresher cooirse. 

The first step in simultaneous solution of two equations is to eliminate 
one of the unknown terms. ) This can be done if the coefficients of 
(i.e., numbers preceding)/ one of the unknowns are made equal in both 
equations. This will allow these terms to cancel out when one equation 
is subtracted from the other. In our example, this can be done easily 
by multiplying both sid*s of the equation (1) by 2. When we do this, we 
get: 



16 - 6a + 12b 



(3) 



Writing equations (3) tod,^(2) t^ether, we have: 

16 = 6:a + 12b - (3) 
19 = 6a + 14b (2) 

If equation (2) is subtracted from equation (3) , the a-terms cancel out 
and we can solve for b: ^ 

16 » 6/+ 12b 
019 ^a +6l4b . ^ 
-3 = ^ 
^3/-2 = b 
1.5 » b 

fo get the value of we can substitute the computed value of b into 
either equation (1) or (2). Let's use the first one: 
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. 8 - 3a + 6b ' (1) 

^ 8 - 3a + 6(1.5) 

8 - 3a + 9 

8 - 9 ,- 3a 

-1 - 3a 

' -1/3 - a 

-0.33 a 

So our straight line equation is: 

a 

y = -0.33 + 1.5x 

What kind of line does this give us? We can find out by substituting 
each value of x into the equation to see what values of y we obtain. 

For -x = 1 , . 



For X " 2, 



y •= -0,33 +1.5(1) 

- -0.33 + 1.5 

- 1.17 



-0.33 + 1.5(2) 
-0.33 + 3 
2.67 



For X - 3, 



y - -0.33 + 1.5(3) I 

- -0.33 +4.5 

- 4.17 _ 

Now we can plot these y-values on the same graph as our three points and 
see how the line fits the points: 



Thie process which we have just gone through iiLs known, as the linear least 
squares technique . And th^e line -we found is the least squares line . We 
will learn more about the properties p£ this line later. 

There is an alternative to using the normal equations to find a and b. 
This alternative is known as the linear fit algorithm and consists of 
two equations which can be directly solved for a and b. They are: 



b = SxSy - iSxy 
and 

a = r- ti£x 
N 



These formulas are simply the result of directly solving the normal 
equations simultaneously for a ahd b. 

Using these, equations and our sfimple work table, we would get: 

b = (6) (8) - 3(19) 
(6)^ - 3(14) 

= 48 - 57 
36 - 42 

= zi 
-6 

« 

b - 1.5 



a = 8 - 1.5(6) 
3 



= 8-9 

» -1/3 
a = -0.33 

And these are the same answers we got before. 
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Once the Wast squares equation has been found, it can be used to calcu- 
late estimated y-values for other than the known x-values. For example, 
let s find the y-values associated with x-values of 1 1/2 and 5. 

For .X - .1.5, 

y = -0.33 t 1.5(1.5) 
-= -0.33 + 2.25 

- 1.92 

For X - 5, 

y = -0.33 + 1.5(5) 

- -0.33 + 7.5 

- 7.17 

^ ■■ " , . 

This capahility of using the least ' squares equation to get the y-values 
given by the least squares line is known as itftratioa and will come in 
handy. later. 

> » 

Properties of the Linear Sqixares Line 

^'W. stated purpose at the beginning of this chapter was to find the best 
line to fit a given set of data points. This type of approach was 
needed since, in almost all cases, no one straight line will pass through 
every given point i 

The linear least squares line is that best line. It is the one line 
which, on the average, comes closest to each of the given points. The 
mathematical definition of this line is: 

The line ar least squares line is that line which minimizes the 
sum of the squared differences from the given points to the 
line. 

^'.j. iff 

For any given set of ^ata pointi, there ia only one line which will tit 
this definition.- Thus, ;, , 

There is one and only pn^" linear le^st squares IJ^e for, a 
, given set of data ^points. ^ ^ , 

To illustrate what is meant Jjy "di|;ferences," consider the example from 
the previous section. There, w^ fit: a linear least squares line to the 
-points: . • ' *• ' . ■■' 




When ve Iterated the resulting equation, ve got these values for the 
line (using the syinbol to represent values on the line): 

X 

1 1.17 
^ 2 2.67 ^ 

3 4.17 

The differences 'referred to In the definition of the ^linear least squares 
line are the dlfferei^ces between the y-values and the y^-values — -I.e., 
y_yl — at each value of x. ■ 

Visually, these differences can be represent^ by the vertical distances 
marked by brackets on the following graph: 



6 




5 




k 




3 








2 


o / 


1 






^ t J 1 1 1_ 




1 2 3 4 5 6 



Mathematlcalj^^^hese^lf ferences are: 

•vi^^,;3r5|*^A- 1 - 1.17 - -0.17 

For X ■>■ 2, 

y - yl = 3 - 2.67 - 0.33 

For « - 3, 

_ yl » 4 _ 4.17 - -0.1/ - 

These differences are also called deviations and are symbolized by the 
srmall letter "d." 
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The "sum of the s^red" differences (or deviations) referred to in the 
definition of the least squares line are calculated by: 



Squaring each of the above deviations; and 
Adding them together. 

If we wuld do this using our example, we woiild get: 
(-0.17)2 + (0.33)2 + (-0.17)2 ^ o.l667 

The definition for the linear least squares line refers to this figure 
(0.1667). Xhe^ linear least squares line is that line for which this 
number is the smallest. In other words, any other line fitted to the 
same poi nts would give a larger number for thg sum of the squared devi- 
ations . ' '' '' ' ' 

We can symbolize the sum of the squared deviations as Xd2 (remember tlmt 
X= sxjm of). ^ 



These deviations can be used to calculate the degree to which the given 
^ta points are scattered, around the linear least squares line/ If we 
. dj^de the sum of the squared deviations (2d2) by the nlanber of known 
data points less one (N-1) , we get the variance. This statistic is a 
measure of the discrepancy between thej^ actual y-values and the y^- 
values. 



Another such measure can be found by taking the square root of the vari- 
ance. The result of this calculation is known as the standard error of 
the esti mate . (This measure may also be referred to as a standard devi- 
ati<0ti.) 

The ite^hematical formulas for these two statistics can be summarized as 
folloWa; ' 



Let s «= 

N « 

Then* 
s2 - 



standard error 
variance • 

the sum of the squared deviations from 
the linear least squares line 
the number of known data points 




To aid in the calculation of thesp measures, it is helpful to 
work table using the following headings: 



t up a 



(The first t^o columns will contain the x- and ^-values of the given 
data points. The third coltunn will list the line-values for each x. 
Colunai'4 contains the differences between y and yl and column 5 lists 
these differences Squared.) 



Usltig our example, such a work t^ble would look like this; 



• 


X 


z 




d 


dji 


N - 3 


/ 

2 
3 

I 
/ 

^. ^ 


1 

3 
4 


1.17 
2.^7 
4.117 


-0.17 
0.33 
-0.17 

2d2 


0.0289 
0.1089 
0.0289 
= 0.16-67 


Then, 


\ 

t 














0.1667 

2 , 




0.0834 






y N-i 


VO.0834 




0.2887 





^Both of these statistics are in the same units as the y-valuBS. 

Other properties of the linear least squares line can be stated using 
the variance and the standard error: u ■ 

t) 

1, linear least squares line is that line for which the 
>^fciance between the given points and the line is at its 

smallest. Conversely, the variance from the given set of 
points t© any other line will be larger. 

2, The Less scatter th^re is between a given set of points and 
its least squares line, the smaller is the value of the 
standard error. This property can be simply stated by saying 
that the closer the points are to the line, the less error 
there will be. 

3, When the points being fitted represent a truly linear rela- 
tionship betweeti the x- and y-values, the deviations about the 
linear le&st squares line form a normal distribution. 



This third point requires some elaboration. Fir&e^^t should be noted 
that not every set of points is linearly-related— i.e. , not every set of 
points is best fitted by a straight line. Some will be better fitted 
directly by a curved line. Others will require some type of data? trana- 
. formation to bring them "Closer to a linear relationship. (One example ' 
of such d^ta transformation will he givSn in the next chapter.) The 
choice of a fitting method should be the result of careful study of the 
known data. 

However, when a set of points is linearly-related ^ approximately 68% of 
the deviations from the* linear least squares line wili-have a value - 
which is between +s (s - one standard error) . Approximately 95% of them 
will be between +2s and approximately 99% will be between +3s. This, 
type of an arrangement of data is known as a normal distribution and we 
will learn more about it in later chapters. 



Now it is time to take what has been learned about a least squares line 
and actually analyze retention data. ] 



CHAPTER 4 



THE NATJIRE OF THE RETENTION gURVE 



Data Transformations 

We now want t^^...apply the least-squares straight-line technique to longi- 
tudinal retention data. To do this, we need to bridge the gap between 
the retention curve in Figure 4-2 and the retention line on the nomo- 
graph (Figure 4-1) on page 58. 

In general, when attempting to apply least Squares techniques to a 
curved line, you first determine the mathematical equation that best 
represents the curve in question. Then it is necessary to determine 
what can be done mathematically to this equation to get data which are 
linear. 

This type of sequence was followed with the longitudinal retention 
curve. We tested various known types of curves against the retention 
cflrve.. From these tests, we learned that the retention curve is best 
represented by what is known as a log-pyobabil ity curye equation. 

This discovery in itself left us with a rather complex mathematical 
equation. Fortunately, there are ways to convert this type of equation 
into a straight line form. These "ways'* consist of what are known as 
data transformations .^ 

•» 

A data transformation involves performing the same mathematical opera- 
tion on each data item (such as taking the square root, squaring, taking 
reciprocals, etc.). A transformation may be performed on both the x- 
and y-values or on just- the x- ot just tihe y-values. A linear least 
squares line is then fitted to the transfqi^rmed data points, the equation 
for the line is iterated and the resulting' line-values are changed back 
to theiif original form. K 

Another look at the line on the nomograph suggests :ghat' there are, trans- 
formations whidh can be made on retention data to bring out a linear * 
relationship. In this case, both the x- and y-values are Involved, 

As was indicated in the nomograph chapter, the x-and y-values for; reten- 
tion data are: . - 

X : time since hire 

y : % remaining from an original group o^. hires 
at time x . , , 

The name "log-probability" suggests ^he data transformation^ which can 
be made on these values. • 



Table 4-1 ' 



TABLE OF LOGARITHMS 



YEAR LOG (10) 

0.1 -1.00000 
0.2 -0.69897 
0.3 -0.52288 
0.4 -0.3979* 
0.5 /-0. 30103 
0.6 -0.22185 
n 7 -6 1-^*90. 
0.8 -0.Q9691 
0.9 -0.0*576 
1.0 0.00000 


YEAR ■ LOG (10) 

5.1 0.70757 

5.2 0.71600 

5.3 0. 72428/ 

5.4 0.73235 

5.5 0.7*036 

5.6 0.7*819 

5.7 -^0.75587 

5.8 0.763*3 

5.9 0.77085 
6.0 0.77815 


YEAR LOG (10) 

10.1 1.00432 

10.2 1.01)860 

10.3 1 .bl284 

10.4 1.01703 

10.5 1.02119 
JO. 6 1.02531 

10.7 1 .02938 

10.8 1.033*2 

10.9 1.037*3 
M .0 1 .0*139 


1.1 ' 0.0*139 

1.2 0.07918, 

1.3 0.1139* 
K* 0.1*613 

1.5 0.17609 

1.6 Q.20*J2 

1.8 0.25527 

1.9 0.27875 
2.0 0.30103 


i.V , o.7853i 

6.2 0.79239 

6.3 0.^993* 
6.* 0.80618 

6.5 0.81291 

6.6 0.8195* 
6 7 0 82607 

6.8 0.83^51 

6.9 0.83885r 
7.0 0.8*510 


11.1 1.04532 

11.2 1.04922 

11.3 1.05308 

11.4 1.05690 

11.5 1.06070 

11.6 1.06**6 

11.7 1 .0681 9 

11.8 1.07188 

11.9 1.07555 
12.0 1-07918 


2./1 0.32222 

2.2 0.3*2*2 

2.3 ' 0.36173. 
2.* 0.3802W 

2.5 0.3979* 

2.6 0.*1*97 
2 7 0.*3136 

2.8 0.**716 

2.9 O.*62*0 
3.0 0.*7712 


7.1 0.85126 

7.2 0.85733 

7.3 0.86332 
' 7.* ^ 0.86923 

7.5 0.87506 

7.6 0.88081 

7.7 0.886*9 
.7.8 0.89209 

7.9 0.89763 
8.0 0.90309 


12.1 1.08279 

12.2 1.0iB636 

12.3 V 1.08991 
12** 1.093*2 

12.5 1.09691 

12.6 1.10037 

12.7 1.10380 

12.8 1.10721 

12.9 . 1.11059 
13.0 1.1139* 


3.1 0.49136 

3.2 0.50515 

3.3 0.51851 
3.* 0.531*8 

3.5 0.5**07 

3.6 0.55630 

3.7 ' a.56820 

3.8 0.57978 

3.9 0.59106 
*.0 0.60206 


8.1 0.908*9 

6.2 0^91381 

8.3 0.91308 
8^* 0.92*28 

8.5 0.925*2 

8.6 0.93*50 

8.7 0.93952 
A.8 . 0.9*448 
8.9 0.94939 
i.O 0.95424 


13.1 1.11727 

13.2 1.12057 

13.3 1.12385 

13.4 1.12710 
13-5 1.13033 

13.6 1.13354 

13.7 1.13672 
13^8 1.13988 
13.9 1.14301 
14.0 1.14613 


*.1 0.61278 
M.Z 0.62325 
*.3 0.633*7 
*.* 0.6*3*5 
*.5 0.65321 

4.6 0«66276 

4.7 0.67210 

4.8 0.68124 

4.9 0.69020 
5.0 0.69897 


9.1 0.95901 

9.2 0.96379 

9.3 0.96848 

5.4 0.97313 

9.5 0.97772 

9.6 0.98227 

9.7 0.98677 

9.8 0.99123 

9.9 0.99564' 
10.0 1.00000 


14.1 1.14922 

14.2 1.15229 

14.3 1.15534 

14.4 : 1.15836 

14.5 1.16137 

14.6 ^ 1.16435 

14.7 1.16732 

14.8 1.17026 

14.9 1.17319 
15.0 1.17609 



- 46 - 



v%2 



Time Since Hire; The horizontal distances on the nomograph Indicate 
that the transformation made on the x- (or time) values altered the 
, spacing between successive years. Notice how the large vertical lines 
get closer and closer together as the number of years gets higher. This 
type of spacing Is called a logarithmic progression . 

Without going into tl|e gory mathematical details, we can just say that a 
logarithmic progression Is formed by taking the logarithms of the time 
values. Every number greater than zero has its own logarithm. The 
value of the logarithm for any number can.be found in tables in most 
math books, by pressing a key on a calculator, or by using a standard 
computer function. 

For use in retention analysis, you will probably neecj access to only a 
s^all range of logarithmic values. Most of these can be found In Table 
4-1. Intermediate values can be found by interpolation. 

The standard notation for the logarithm (t) of ^a number r Is: 

^ t = log r " 

The Importance of logarithms in retention analysis is due to the effect 
thlBy have on the distances between numbers. To show this, consider 
first the logs of 1 and 2. 

log 1 - 0 

log 2 = 0.301 

Thus, the distance between 1 and 2 (in logarithms) is 0.301. In addition, 

log 4 = 0.602 

Thus, the logarithmic distance from 2 to 4 Is; 
* 

log 4 - log 2 = 0.602 - 0.301 = 0.301 

But this is the same as the logarithmic distance between 1 and 2. This 
*is why on the nomograph, the horizontal distance between "1st year" and 
"2nd year" equals the horizontal distance between "2nd, year" and "4th 
year." 

* • . ■ ■ 

Plmilarly, the logarithmic distance from 10 to, 100 is the same as that ^' 
from 1 to 10. What thijs means is that the logarithms of successively 
larger numbers are closer and closer together. 

\ 

It is also possible to transform the logarithm of a number back to the < 
nun^ber itself. This can be done by using a table of logs backwards. 
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Mathematically, If 

' t « log r 
then, ^ 0 

r » IQt 

(i.e., 10 raisedVtQ the; power of tJ This is true because we are using 
at are known as Vbase JO" logarithms O 



What happens when^ turnover data are plotted, on a standard graph using a 
logarithmic scale on the* x-axis? ^ 

Consider again the sample nomograph data. You will remember that, when 
plotted on a regular graph, the turnover curve looks like this: 'jy 



1007or 




Plotting, this same data with a logarithmic scale on the x-axis, i.e., 
with the tisle values in logsl.' (and no change on the y-axis) , we have: 



100%^ 



50% 




.4.5 6 7 



17 The log 9cale on the x-axis starts with one (1) since the logaifithm 
of 1 is zero. 



When retention data are plotted in this manner, their relationship 
ohyiously comes much closer to linearity. However, we can get an even 
better linear relationship by also making a transformatidn of the v- 
values (percent retained). ^ . ^ 

Percent Retained; The second transformation uses, certain properties of 
l ookrilte''thl8^^*'^ A normal curve (also known as a bell cirve) 



This type of a curve has a number 
S3nOTiiBtrlcal about a mldlin^f* drawn 
to nbe K-flxlp, MV-f* Rp: 



of special properties." First, It Is 

from It (? b1gh©^t poln^ r^^p^r^d^^nlArly 



""^^ 5^ ^""^ «P^^^« under the normal 

probability curve Into two exact halves. Thus, half the curve Is to the 
left of the mean and half Is to the right. 



Second, distances from the mean are measured In standard d eviations." A 
standard deviation Is the same kind of measure as the standard error 
which was discussed In the previous chapter. The standard error Is a 
measure of the scatter about a linear least squares line. The standard 
deviation Is a measure of the scatter about the arithmetic mean of a 
group of values. ■ 
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If a given group of values is what is known as normally distributed, the 
plot of the percent of tlie frequency of occurrence (or probability) of 
each individual deviation from. the mean will form a normal curve. For 
example, suppose we have the following group of values: 7, 8, 8, 9, 9, 
9, 9, 10, 10, 10, lb, 10, II, II, II, 11,^12, 12, 13. The mean (or 
average) of these values is 10. The deviations from the mean are found 
^r-v by subtracting 10 from each value. Since most of the values occur more 
than ojQce, we get the following set of deviations and frequencies (of 
each ' devlat ion) : 



< 



Deviation 



3 
2 

-1 
0 
1 

2 
3 



1 ^ 



Number of 


Percent of 


Occurrences 


Occurrence 


1 


5.26% 


2 


10.53 


4 


21.05 


5 


26.32 


4 


21.05 


2 


10.53 


1 


5.26 


19 


100.00% 


onrop using a bar p'^r'' 


^1 «. J 1 V la * 


1 






-mm 

V 

You can seo that w>'en n sTnooth cwrve drsvn c^nj^porints thf mIrlroiDte 
of the top of each bar. a normal curve appears. 

The standard deviation of these devia tion values, using the same formula 
that was used for standard error ^^/Yi-l) is 1.53. It was stated pre- 
viously that distances from the mean of the normal curve are measured in 
standard deviations. If we re-plot the normal curve above marking off ^| 
standard devlatiou dtstance". haver 
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Note that the values to the left of the mean are negative and those to 
the right are positive. Wien Chalking about or measuring x-axis values 
for a normal curve, we artf concerned with the number of Itandard devia- 
tions from the mean. Foy^^^^ample, we might want to know the curve value 
for a point Iho^e from the. mean, or -.37a , or Sa's, etc. 

"As with the standard error^^^oercentage values can be associated with 
standard deviation values, in this case, the percentage values refer to 
the percent of the area tinder the normal curve which is between perpen- 
dicular lines drawn from two 'standard deviation values on the x-axis to 
the curve. The percent of area under the entire curve, from the leftmost 
to the rightmost point, is 100%. 

The percentage of area between (a) - a and + a is 68%, (b) -2 a and +2 a 
is 95.5% and (c) -3<r and +3 a is 99Z. These percentages are the same 
as those between the same range's of the standard error. (Thus, the 
deviations from the linear least squares line are normally distributed.) 
These percentages can also be expressed as probabilities. Foij example, 
the probability that a deviation will fall between +»a is 0.69. In 
other words, there are 68 chance? out of 100 that a'^deviation 5l*ll fall 



Table 4-2 

•NORMAL CURVE AREA CONVERSION TABLE- 



PERCENT 
REMAINING 



STANDARD 
DEVIATIONS 
FROM MEAN . 



PERCENT 
REMAINING 



STANDARD \ 
DEVIATIOHS^ 
FROM M^AN 



PERCENT 
REMAINING 



STANDARD 
DEVIATIONS 
FROM MEAN 



95 

92 
91 



1 .SkkBS 
1 .SS'*?? 
1 .47579 
1 .40507 
1 .34076 



65 
64 
63 
62 
61 



0.38532 
0.35846 
0.331 85 
0.30546 
0.27932 



35 
34 
33 
32 
31 



•0.38532 
■0.41 246 
■0.43991 
-0.46770 
-0.49585 



90 
89 
88 
87 
86 



1 .28155 
1 .22653 
1 .1 7499 
1 .12639 
1 .08032 



60 
59 
-58 
57 
56 



0.25335 
0.22755 
Q. 201 89 
0.1 7637 
0.1 5097 



30 
29 

28 

2/ 
26 



■0.52440 
■0.55338 
•0.5-8284 
■0.61281 
•0.64335 



85 
84 
83 
82 
R1 



1 .03643 
0.99446 
0.9541 7 
0.91 537 
0.87790 



55 
54 
53 
52 
51 



0.1 2566 
0. 1 0043 
0.07527 
0.05015 
0.02507 



25 
24 
23 
22 
21 



■0.67449 
■0.70630 
■0.73885 
■0.77219 
■0.80642 



80 
79 
78 
77 



0.841 62 
0.8O6'*2 
0.77219 
0.73885 

n . 7n<; 70 



50 
'♦9 
'♦8 
'♦7 



0.00000 
■0.02507 
■0.05015 
-0.07527 

0 . 1 no43 



20 
19 
18 
17 
16 



-0.84 162 
-0.87790 
-0.91537 
■0.95417 
0. 994^46 



75 
7«» 
73 
72 
7 1 



0.67'*49 
0.64335 
0.61 281 
0.5828'* 
0.S5338 



'♦5 
4<* 
'♦3 
'♦2 



0. 1 2566 
0.1 5097 
-0.1 7637 
-0.20189 
-0.2275'* 



15 
14 

13 
1 2 
1 1 



1 .03643 
1 .08032 
1 . 12639 
1.17499 
1 .22653 



70 
69 
68 
67 
66 



0.52440 

0.49585 
0.46770 

0.43991 
0.41246 



'♦O 
39 
38 
37 
36 



■0.25335 
-0.27932 
-0.30548 
■0.33185 
-0.35846 



Id 

-9 
8 
7 
6 
5 



1 .281 55 
1 .34076 
I .40507 
1 .47579 
1 .55477 
1 .64485 
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between + a . 

In addition, we can associate with every distance (or standard devia- 
tion) value from the mean (plus and minus) a unique percentage value 
which represents the percent of area under the normal curve from the 
leftmost point to a line drawn perpendicular to the curve from the given 
standard devia:tion value. For example, the percent value associated 
with tlie mean (or 0 a ) is 50% (since exactly half of the curve lies to 
the left of the mean). Similarly^ the percent value associated with + a 
is 84%. The percent value being measured is shown in the shaded area: 




In the same way, one specific percent value can/ be associated with each 
standard deviation value. And, conversely, one st andard deviation valu^ 
can b e associated w ith each percent value 
way «T^Rf orm^^tlon . ) 



Tbip Is our second data tr«nsf orma t i on ! transf-rmdng "percent remf^-fnl r 
I.e., p'rc^nt of employees sti^l re"'ain*ng f'oro t'^e starring gronr 

r>Mfnhor r> f oM^nHor^^ <1 o vH n ♦ < or> fa f » f ho moon ^^f o normal mr"/-." 

As vith loKaritbms, tb<=*se values r^n be found in matheniat-i cal tables- A 
S5imple one found In Table ^4-2. This table requires tnterpolation ^ • 
Intertnerllate values. More detailed tables c^in be ^'ounH iT> statistics 
boo^^B. There is also ^ ma^hematlraI formula whicb may be u«*><^ f<^<^o 
cal AnaTyfti« for T.OOFKO in tbp Apppndix jserfion.) 

Using the Normal Curve Area Conversion Table (Table A-2), we can see 
that a percent remaining figure of 85% corresponds to 1.036/13 standard 
deviations, 80% to O.8/4I62 a , etc. 

Look again at the vertical lines on the nomograph (p. 16) and consider their 
scaling. The percents on the scale are spaced accordinK to their corre- 
sponding standard deviation distance from the mean (which is represented 
by the 50% mark). 
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For example, +1 a corresponds to 84.1% and \ o x.o 69.1%. If a mark is 
made at each of these values, It can be seen that the distance from the 
84.1% point to the 69.1% point is exactly the same (i.e., ^ <7 ) as the 
distance from the 69.1% point to the mean (50%). These are not equal 
distances in straight percent since: 

•,84.1% - 69.1% = 15% 
^ 69.1% - 50.0% « 19.1% 

The transformation of "percent remaining" to niimber of jBtandard devia- 
tions from the mean of a normal curve, when used with the transformation 
of time to log of time, gives the straight line which can be found on 
the nomograph. ^ 

Now we have the two data tranformations that were needed: 

1. Years of service is transformed into the logarithm of years - . 
of service ; and 

2. Percent of employees retained' from an original group of hires 
is transformed into number of standard deviations from the mean 
of a normal probability curve. 

Some Comments About Retention Analysis 

These comments will cover four areas: 

1. The collection of retq^tion data; 

2, The reasons behind the close relationship b^^t^een rptention 
data and the log-p.robabiHty technique; 

The condltibn under which the proiectl*-'' ^ ' ^*^^'^^^f^v 
equ«ition femal'^s '^ali'^; and 

A. The ront-ly^uHv ^>>o r e f *>r> l f <m> ^iirvo 

Data Collection 

It has been stated that retention datfl are collected hy following a 
group of h"Jres ov*»r t1m<a «rd counting thp number left After cert^^n 
lengths of time. 

The group that is followed (the "original group" or "cohort") is composed 
of employees hired during a specific span of time. This span should be 
limited to one year or less. The "certain lengths of time" could be any 
standard time units (years, months, days) after the end of the original 
time span. Since the original group will contain employees with different 
lengths of service at the end of the original tiine span (from 1 day to 1 
year), there are several options available for assigning values to the 
time units. 
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For example, consider a group of employees hired during a given fiscal 
year ending September 30. And suppose that retention data Ipof ^this group 
are recotded on succeeding September 30th' s. The time valu^S^ fot thes.^ - 
data points may be: y'yi 



(a) 



1.0 year, 2.0 years, etc. (calculated from September 30th to 
September 30th); or 

(b) 1.5 years, 2.5 years, etc. (assuming an even distribution of 
hires during a year and using the midpoint; i.e., 0.5); or 

(c) 1.x years, 2.x years, etc. (where x is a factor based upon 
specific hiring patterns, such as more hiring done at the end 

/of a fiscal year); or 

(d) r.y years, 2.y years, etc. (where y represents the actual 
average length of service of the original group at the end of 
the hiring time span). 

the number to the right of the decimal point is known as the time aver- 
aging factor. When this factor is greater than 0, it represents the 
distribution of hiring into a group during a time span. 

Different occupations have different turnover /retention patterns. Some 
occupations, such as Internal Revenue Agent, Foreign Service Officer, 
etc., are unique to government. This type of occupation does not exist 
in the private sector. Consequently, since there are no places other 
than government where the employee can go and still remain in his or her 
occupation, the turnover in thes^ occupations tends to be low. Other 
occupations, such as Clerk/Typist, are widely distributed both inside 
and outside of govertitnont. Thus, th^re ^tb many possible employers for 
people in these jobc. ron«eniiont] v. t-ho mmover -fn thf»«« omipPfHonc 
tends to be h^gh. 

These differences in turnover patterns among occupations require that 
retention data be collected separately by occupation. It may be possible 
to combine occupations with similar turnover patterns at a later stage 
of retention analysis. However, the original data should still be 
collected separately - 

It may also be desirable to collect retention data by grade-at-hire 
within an occupation since there may be some variation in the turnover 
patterns of employees in different entry grade levels (e.g., GS-5 vs 
7) • 
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Why Retention Is A Log-Probablllty Function 

Our analysis of longitudinal retention data led us to the use of a log- 
probablllty equation. Why does this particular equation fit retention 
data so well? 

The concentration of turnover In. the early years of service suggests 
that turnover behavior Is exhibited logarithmically over time. Aa with 
the values of logarithms, the figures, for percent retained from an 
original group get closer together as the number of years since hire 
gets higher. 

A logarithmic progression over time is also characterized by equal move- 
ments during equal logarithmic times. For example, consider the data 
for the printed line on the nomograph ^Figure 2-2 > 16). 

Losses from 1st year to 2nd year = 58% - 44% = 14% 
Losses from 2nd year to 3rd year 44% - 36% = 8% 
Losses from 3rd year to 4th year = 36% - 30% = 6% 

Note That the percent lost from the Ist year to the 2nd year (14%) is 
equal to the percent lost from the 2nd year to the 4th year (8% + 6% 
14%), (And remember that the logarithmic distance from 1 to 2 equals 
that from 2 to 4.) This also suggests that turnover behavior is dis- 
played lt)gar Ithtnically over time. 

The effectiveness of the second, or "probability," tranf ormatlon relates 
to the distribution of attitudes toward work whic^^ Is dlsplay**d In tror^^ 
groups. ThlB di Rtribll^ion tends to form ^ normal prohpMlifv curve. 

Each person's attitude t^^ward his or h^r job Is made up of literally 
hundreds of element^ and combinations. For oven « Sm'^ll group, t-hen, 
the ♦'ofal number of «uch elements will r«>ach into th^ thousands- For 
only a fe^ emrloye«»e are these attitudes lively to be extremely negative 
Likewise, for only a few employees ere they likely to be extremely posl 
tlve. By far the llkell*>8t Is that, an employee's' set of work Attitudes 
will tend to concentrate toward the midpoint between the^e two e^tretne*^ 
(negative and positive), where the positive ar^d negative factors are 
more nearly in balance. This distribution of einrloyeo work nrritydpR 
tends to form ^ nomi;^! probability curve. 

In terms of group losses we might express this by saying that In any 
given group of hires there is likely to be: one small group that will 
hate the job and want to quit immediately; one small group that will 
love the job and never want to leave; but the bulk of the group of 
hires, who have some positive and some negative feelings about the job, 
will be bunched up in between. Their turnover will likely be neither 
early nor late, but spread out over a substantial period of time. 
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Condition For. Projection Validity 

For log-probability projections of longitudinal retention data to remain 
valid it is necessary that the organization's internal situational Caee?-^ 
tors affecting* a group^s. retention behavior during the Initial period of 
empirical observation continue substantially unchanged throughout the 
period of the projection. 

This condition involves the total working situation within a given 
organization. As long as this working situation does not go through any 
"^i^^^l^^'^Ses (such as a large reorganization, a radical program change, 
etc.) during the data collection and projection periods, turnover patterns 
will not change., However, major changes in the working situation will 
be followed by changes in employee turnover. This is because such 
changes affect the mix of positive and negative work attitudes within 
the groups touched by the changes.- Some employees who felt positive 
toward the previous work situation will feel negative toward the new one 
and vice versa. After an initial period of adjustments, turnover will 
again settle into a log-probability pattern albeit a somewhat different . 
one than before. 

Continuity of a Retention Curve 

One of the most interesting characteristics of the retention curve is 
its relative independence of external economic «nd political happenings. 

For exainple, one srudy of retention followed a group of hires into the 
Federal sf»rvlce in the most populoupr professional, fldm-i.nistratlve , an<^ 
technical occupationp during FY-63. Th<^ same group of employees was 
followed Tor 7 yenrs (through 1970) ^r>^ the percent re^alned was re 
cordea at the end of ej^ch ^ Ikc year Th^se rercents w^^re pl^'t^ed on 
nomogr 'ph ^Figur'^ A--^) a'»d t^'^ »Tr* Mt»o wn<7 nnf»rt>nfo,7 noiT^pr o m»^<>» 

Figure 4 •? sh^^wR the same ''ata plotted arithtnet icall v . A curve was 
fitted to the points uslne th*' log-probaV^llIfy leasf "quar*»s teohxiiqu** 
Again, notice how closely the curve fits the points. Note that at no 
time during the seven-year period do the actual values differ from the 
f it^ted_ curve more than 1% . 

This means that the retention curve for this group continued undlsrupted 
over the entire seven-year period. But this was an era when the private 
economy was experiencing substantial ups and downs. It was also the era 
during which the number of Federal employees first greatly Increased then 
began to decrease again due to the situation in Viet Nam. These were 
truly major changes ye,t neither of these had any effect on the retention 
curve of the group. 
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1 
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2 


74 54 


75 15 


3 


70 34 


69 81 


4 


65 35 


65 74 


5 


62 34 


62 44 


6 


59 97 


59 68 


7 


57 35 


57 30 


10 


(PfOjected) 


51 72 
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From theseJand similar findings, we conclude that log-probability 
turnover curves, once established, are for practical purposes non- 
responsive to external economic and political Influences. 



CHAPTERS 

FITTING AND USING THE LOG--PROBABILITY CI 



Manual Calculation of the Log-Probablllty Equation 

Now ^ that we know about the necessary data transformation and data 
collection techniques, we are actually ready' to use the method of least 
squares to fit a log-probablllty turnover curve, (A more detailed 
version. of this process' may be found In the appendix Manual C alculation 
of the Log-Probablllty Curve ,) ^ " 



As an example, let's use the following data: 

Years Since 
Hire 



1.5 

2:5" 

3.5 

4.5 



Percent 
Retalnedl. / 

83.2% 
75.6% 
70.4% 
65.6% 



The graph of this retention curve looks like this; 

100% 



The first step In fitting these data Is to convert them Into their 
linear form using the data transformations discussed in Chapter 4. The 
"Years Since Hire" can be transformed directly into logs using Table 4- 
1. From this table we get: 



TT 



Note that throughout this and the following chapters an employee is 
considered /'retained" only if the employee continues to be in the 
same occupational group for which the analysis is made. Any employee 
who leaves thia group in arv^ wa^ (quit, change to other occupation, 
etc. )^ Is a "loss," even if that employee continues to work in the 
same organization. 
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log 1.5 - 0.17609 

log 2.5 - 0.39794 

log 3.5 - 0.54407 

log 4.5 « 0.65321 



The second transformation — fvbm percent remaining to number pf standard 
deviations from the mean of the normal curve — can be done by using the 
Normal Curve Area Conversion Table (Table 4-2, page 52) and some linear 
interpolation. 

Linear interpolation is simply a method to find a valujB'for a number 
which falls between two entries in a table. For example, what standard 
deviation value corresponds to 83.2% (our first data point)? Now, 83.2% 
is two-tenths of the way between 83% and 84%. So we^want to find- the 
standard deviation value which is two--tenths of the way between 0.95417 
(83%) and 0.99446 (84%). To do this, we can multiply the difference 
between, these two values by 0.2 and then^add this difference to the 
smaller «andard deviation value (0.95417). If we do tl4s, we get: 

(1) 0.99446 - 0.95417 « 0.04029 
'(2) (0.2) X (0.04029) = 0.00806 
' (3) 0.95417 + 0.00806 = 0.96223 

Thus, the standard deviation value corresponding to 83.2% is 0^.96223. 
Continuing in this marihiBr, we get: ^ . ^ 

Percent Standard 
Retained Deviations ' 



83.2% 0.96223 

75.6% 0.69358 

70.4% 0.53599 

65.65: 0,40160 

Now we have a set of x- and y-valuef? and can set up a work table such 
the one described in Chapter 3, To help keep the^ dat^ r^latiotishlps 
clear, we will expand the work table include both the yeAtP p-Ince 
b-fre «nH perront: retained valuef?, 

i 

Log -Probablllty Work Table 



Years Log Percent Stan. Dev. 

Since Year Retained From Mean 

Hire (x) (y) (x)(y) x2 

1.5 ' 0.17609 83.2% jf 0.9^223 0.16944 0.03101 

2.5 0.39794 75.6% . 0.69358 0.27600 0.15836 

3.5 0.54407 70.4% 0.53599 0.29162 . 0.29601 

4.5 0.65321 65.6% 0.40160 0.26233 0.42668 " 

N=4 1.77131 ' 2.59340 0.99939 0.91206 
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The next step is to substitute values from the work table 
(a) the normal equations or (b) the linear, fit algorithm. 

As a reminder, the normal equations are: • : 

Sy = Na + bSx 
2xy « aSx + bix2 

From the work table, we have: ■ , ' ' 

2x = 1.77131 . 

ly = 2.59340 
^2xy = 0.99939 . ' , * " 

*ix2 = 0.91206 • ' ' 

N = 4 

I 

Substituting these values into the n,oAnal equations gives 

2.59340 = 43"+. 1.77131b 
0.99939 = 1.77131a + 0.91206b 

Solving these equations simultaneously gives: 

a <=. 1.165275 
b = -1.167328 

And the log-probability equation is: 

r*- y = 1.165275 - l.l67328x 

The linear fit algorithm is: > • 

^ " K£xy - SjtSv 
NSx 2 - 

[ 

a = 2y - blx 

N . 

Substitution results in: 

. ' t = 4(0.99^39) - (1.77l'3lj(2. 59340) 

4(0.91206) - (1.77131)2 . 

=-1.167328 



a = 2.59340- (-1.167328) (1.77131) 



= 1.165275 
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And, again ^ the log-probability equation is: 

y * 1.165275 - 1.167328x 
Now that we have the log- i/robabll 1 ty equaLlou, what cau we do with itr 
Iteiatlog and Frulectioa of the Eguatiun . 

One major use of the log-- probability equation ia to calculate cuix/t 
values for given time values. This function Includes finding curve 
values for; 

1. The time vaiuea ubed to calculate the equation (aO that co^-ve 
values Can be compared with actual values); 

2. Any time valuea which tali between those uacd to calculate thk 
equation (to estimate retention at intermediate times); and t 

J. Any future time valued (to project uuiubeiS to be retained at 
any future point in time). 

Foi the flrat two tyi>eo of time valuea , thla i>i.oceaa la called lte£atlou > 
For the third, it is called projection . But the mathematical ateps 
involved are the aame in all three casea. For any given time value, 
these steps are: 

1. Convert the , time value to loga. 

2. Substitute the log of the time value Into the log- probabl 11 ty 
equation. Thia substitution involves: 

a. Multiplying the log of the tline value by the value of b, 
and 

b. Adding the reault ot thla multiplication to the value of 
a. 

The result of this substitution la the curve value gl;ven by 
the equation at the given time value. : This curve value is in 
standard deviationa^f rom tt^e mean of a normal curve. 

3. Convert the resulting curve value to* its cqrresponding percent ^ 
remaining value. 
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For example, using the equation which we calculated In the previous 
section and the time-value 1,5 'years, we get: 

y '-^. 165275 - 1.167328 (log 1.5) 

- 1.165275 -'1.167228 (0,17609) . i 

= 0.95972 

This value is in standard deviation units'*. Using Table 4-2 and inter- 
polation, it converts to a percent remaining value of 83.14%. 

To find the curve value at an intermediate point, say 2.75 years, we go 
through the same steps: 

y » 1.165275 - 1,167328 (log 2.75) 
= 1.165275 - 1.167328 (0.43933) 
■= 0.65243 

This value converts to 74.29%. 

To estimate the percent remaining at a future time value, say 5.5 yeats, 
we again do the same things: 

y - 1.165275 - 1.167328 (log 5.5) 
= 1.165275 - 1.167828 (0.74036) 
= 0.30103 (or 61.83%) 

To convert these percent remaining" figures to "number of employees 
left from the original group," divide the "percent remaining" figure by 
100 (to get the percent to its decimal f orm) , .and tfien multiply the 
number in the original group by this quotient.X Suppose, for example, 
that the original group consisted of 250 employees. *Then the estimated 
number of employees remaining five years since the end of the hiring span 
(i.e., 5.5 years using the averaging factor) is^ 

61>83% X 250 = 155 employees 
100. 

(Note - If you use a calculator, a mathematical formula, or a computer 
program to find percent remaining values, the result will be i»,uecimal 
form so that division by 100 will not be necessary.) ..-^ 

A 30-year projection of the sample log-probability equation is shown in 
Figure 5-1. This graph also shows the positions of the four actual 
percent remainijig values. Here, again, is the standard retention curve. 

The points which are plotted ^or years 5 to 30 in Figure 5-1 represent 
the most probable values for projected future retention. They are esti- 
mates. It is also possible to determine a range of values within which 
future ret Alt ion values will most likely fall. We will show how to do 
this in an upcoming" section. 
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Application To Hiring Levels 



In Chapter 2, we applied data read from the nomograph to several prac- 
tical planning situations. We can now use thd log-probablllty equation 
to handle the same situations. To demonstrate this, let's use the log- 
probablllty equation we calculated earlier to represent a given occupa- 
tion: . 
y 

y = 1.165275 - 1.167328x 

Suppose, as we did In Chapter 2, that an organization has been hiring 
exactly 100 employees Into a given occupation, each year for the last 6 
years. Suppose also that this hiring was evenly distributed during a 
year (so that the averaging factor will be 0.5). To get a length-of- 
servlce dlstrlbutloivf/of those employees still on board, first Iterate 
the equation using the time values 1.5, 2.5, . . . 6.5 (representing 
from 1 to 6 years since hire). When we do this, we get: 





Percent 


Year 


Remaining 


1.5 


83.14% 


2.5 


75.83 


3.5 


70.20 


4.5 


65.64 


5.5 


61.83 


6.5 


58.56 



TtaualaLlnft these percent values to numbers retained (with 100 employees 
In each orijjinal group) gives: 

Nximber of Employees 
Years Since Hire Retained 

6 59 
5 62 
^ 66 
3 70 
2 76 
1 _83 

416 

Now we want to determine how many of these 416 employees will leave 
during the upcoming year. By the end of this next year, each employee 
still on board wilTTfflb^ to the next length-of-servlce category. For 

17 In an operating sltuatlotrT^a^ength-of --service distribution can be 
found by simply counting the number In the given occupation who are 
In each length-of-servlce category. 



example, thoae In the one-year-since-hire category will move to the two- 
year s-slnce-hire category. From the log-probability equation, we know 
that 75-83% of the original group are retained to the end of the aecond 
year J So this ^oup will go from 83 to 76 (KIQ. x 0,7583) employee*. 
Each of the other groups are reduced in the same manner. (For the 
smallest group » we need the percent retained value for 7.5 years which 
is 55.72%,) 



This prucesci 


gJlvcD the CoIIowJlug tlgujLca 


tojL t|ic cud o£ 


Llic UexL ycaJL 


Years 


Mtj^ber at. End 


MuiuLet Oue 




biuce UIlc 


of Current Yeai. 


Year Later 


m Ye, 


6 


59 


36 


3 


5 


62 


59 


3 


A 


66 


62 


4 


3 


70 


66 


4 


2 


76 


70 


6 


1 


83 


76 


7 




416 


389 


27 



XLue*, 2/ i>£ lUti A 16 i oyeed will i«iav/c JuAiug the upui>iiiiu5 yieui. . iikiu 

is a tHraover rate of .0649 (z7/Alo) or c.A9%. 

We can use these fl^Xitea to show how dltferent level o of iilrini^ afft^cL 
the tuiTXOver rate of a gi^oup. 

Suppose that during the current year die aame pattern ot hiring coutin 
ues; i.e., 100 employeeo are hired into the occupation in an even dis- 
tribution during the year. Using the log-probability equation we know 
that 83 of these 100 hires will be retained by the end of the upcoming 
year. This mean^ that 17 of them will leave. Combining these losses 
with those from the other length-of-servlce categories gives: 




17 4 27 AA loases 
out of 



100 f 416 « 516 employeea 

M. 

This gives a turnover rate of .0853 (44/516) or 8.53%. 

Now suppose that instead of hiring 100 entployees into the occupation 
this year, the organization decides to hire 200, The niunber remaining 
from this group of hires at the end of the upcoming year would be 166 
(i. e., 200 X '.8314). thus, 34 of the new hires will be lost. Com- 
bining these losses with those from the other length-of -service -ipate- 
gories (which remain the same) gives: 
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34 + 27 « 61 losses 



out of 

200 + 416 = 616 employees 

In this case, the turn^er rate would be .0990 (61/616) or 9.90%. 

Continuing in the same niaimer gives the following turnover percentages 
for the group under different levels of hiring: 



Number Hired 


Nximber . 


Nxnnber of 


Turnover 


Current Year 


Lost 


Employees 


Percent 


0 


27 


416 


6.49% 


_ 50, 


35 


466 


-7.51 


100 


44 


516 


8.53 


200 


61 


616 


9.90 


300 


78 


,716 


10.89 



These data illustrate again a point which was discussed in Chapter *2 but 
which is of^uch importance that it will be stressed again here.^And 
that is that the rise and fall of turnover rates is in response to 
the rise and fall of hiring rates . 

Thus, when a management decision is made to increase hiring, the Impact 
of the decision on turnover can be determined. By the same token, if a 
decision is made to freeze hiring or even to have a reduction-in-f orce, 
the subsequent decline in turnover can be measured. ^ 

Other Applications 

The log-probability ecjuation can be applied to other personnel question^'. 
A few of these applications will be discussed here. We will continue to 
use the same sample log-probability equation. 

1. What is'the probability that a new hire into an occupation 
will still be on board x years from now? 

This question can be answered simply by iterating the log-probability 
equation for the value of x. For example, if we want to know the proba- 
bility that a new hire into our sample group will last 6 years, we iter- 
ate the log-probability equation for 6 years: 



y = 1.165275 
= 1.165275 
«= 0.25692 



1.167328 (log'6) 
1.167328 (0.7.7815) 



This standard deviation value translates to 60.14% or a probability 
value of 0.6014. So that there is a 60% chance that a new hire into 
this occupation will stay for 6 years. ' r 

2. What is the probability that an employee who has already been 
on bpard for x years will still be on board in t years? 

To answer this question, we need two values; 

(a) the probability that an employee will stay for jt yeaid, and 

(b) the probability that an employee will stay for x t years. 

Then the probability that an employee with x years of service will stay 
for t more years is calculated by dividing (b) by (a) . 

For example^ If we want to know the probability that an employee who has 
3 years of service will stay for 3 more years, we first iterate the log- 
probability equation for the values 3 and 6 (3+3). This gives: 

y - 1.165275 - 1-167328 (log 3) 
« 1.165275 - 1.167328 (0.47712) 

- 0.60832 

y « 1.165275 - 1.167328 (log 6) 
« 1.165275 - 1.167328 00.77815) 

- 0.25692 

The first standard deviation value (0-60832) transforms to a probability 
of 0.7285. The second (0.25692) transforms to 0.6014. So that the 
probability that an employee will stay 3 years is 0-7285 and the proba- 
bility that an employee will stay 6 years is 0.6014. 

Now the probability that an employee with 3 years of service will stay 
for 6 years is: 

Probability of staying 6 years ' 
Probability of staying 3 years 

Or, in this case: , ./ ' 

0.6014 - 0.8255 
' 0.7285 

Thus, in this group, an employee with 3 years of service has an 83% 
chance of remaining 3 more years. Or, in other words, if 100 employees 
in the given occupation have 3 years of setyice, then 83 of them will 
stay for at leaQt 3 more years. , 



3. How many employees must be hired now^ so that x of them will be 
on board in t yea*rs? 

To answer this question, we simply Iterate the log-piobabil Ity equation 
for t + s years (where s = an averaging factor, if any) and divide the 
number of employees wanted In t years by the resulting probability. 

For exwuplc. it we want to have 100 ei4luyeea in our sample occupation 
on board in 3 years, how many should be hired this year (supposing that 
these hires are made evenly throughout the year)? First, we must Iter- 
ate the log-probablllty equation for 3.5 years (assuming an averaging 
factor of 0. 5) : 

y - l.lb!)2/5 l.lo/32tt (log 3.5) 
= 1.165275 - 1.167328 (0.54407) 
- 0.53017 

tUia uiaudtoiiua Lo d ptubablllty vulue ot U,/UZU. To gee ihc a«iuli.ed 
answ..r, we divide lOO (the number of eiupluyees wanted on boar J in 3 
years) by 0.7020. This gives: 

100 •*• 142-templ«jyceu ( 
.7020 \. 



Thu«. 14/ eM^^loyecB muat be lured lu the occupaLlou Lhla year that 
100 ot them will be on board in 3 years. 

A. Aasumlng that a training course costs $1000 ^^er employee, what 
Is the actual cost per employee trained and still on board in 
t years when (a) only new hires arq trained and (b) only 
employees with x years of service are trained? 

SuHpobc that a decision Is made to train 100 employees In the aample 
occupation. And suppose we want to know the actual cost per employee ' 
trained and atill on board three years after the training program. - 

In case (a), all of the employees trained would be new outside hires. 
Three years after the training program those employees from this group 
who are still on board will have an average length of service of 3.5 
years (assuming an even distribution of hiring during a year). 

The first step is to calculate the probability that -these employees will 
be retaine<^ in three years. To do this,, we use the method for question 
1; i.e., iterate the log-prob,ability equation for the time ^lue desired. 
In this case, the time value is 3.5 years. This gives: 

y"= 1.165275 - 1.167328 (log 3.5) 
= 1.165275 - 1.167328 (0.54407) 
= 0.53017 
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This translates to a retention probability of 0.7020, 



To determine the actual cost of the training program pet uew hire still 
on board in 3 years, divide the billed cost of the training ($1000 per 
employee) by tHe 3-year retention probability (0.702.6). This gives: 

/ ^ 
$1000 - i$1425/employee traljcied / 

0.7020 

Thus, Lhe 3 -year return on the tralulug iAvebtuieuL wlieu ouiy ucw ouLblde 
hires are trained is $1425 per employee trained and stiil on boaid. 

For case (b) , suppose that all the employees to be Ltalued already have 
three years of service. Three years after the training program those 
employees from this group who are still on board will have an average 
length of service of 6.5 yeara. 

In this case, we aeed to calculate the probability that c^iuploycea wltli J 
years of service will reach 6 years of service. To do this, we uae the 
method of question 2. Using this method gives: 

^ y - 1.161)275 - 1.16732b (lofe 3.:)) 

« 1.165275 - 1.167328 (0.54407) 
- 0.53017 (or 0.7020) 



y - 1.165273 - 1.16732b (log 0.:>) 
« 1.165275 - 1.167328 (0.81291) 
- 0.21634 (or 0.5856) 



I'he reteiitloii probability tor thla group is; 

0,5856 - 0.b342 
0.7020 



Now we can again divide the billed cobt or the ti.aluiug ptograiu ($1000 
per trainee) by the calculated retention probability (0.8342>. This 
gives: 

$1000 - $1199/employefe trained 
0.8342 

Thus» the 3-year return on the training investment for this group is 
$1199 per employee trained and still on board. 

The important thing' to notice here is the per employee cosV difference i 
that occurs solely as the result of a selection decision on who is to bi 
trained. In the long rxin, it will cost an organization more money to 
train new hires than to train employees who have been on board for a 
while.- This i^ a factor which should be taken into account when train- 
ing decisions are made. 
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The same type of analysis can be made for any other personnel arei. 
involving the selection of employees, Thl^ includes decisions on j 
filling vacancies by hiring from the outside or by reassignm'fent fr|m 
within. The outside hire has a greater probability of leaving in leab 
time than 4oes a reassigned employee. Again, this Is something a man- 
ager should knoy before such decisions are^made. The manager may still 
decide. to fill vacancies by outside hiring but the higher turnover .level 
that follows should not be a surprise. 

More deLalicil de«cripcious ot chc applicaLion^ di«cuoaca lu Lhlb c^ecLloi^ 
plus other applications can be found in the Appendix secLiou., 

Tlie ttppllcdLloutt aliowu in Lliitt ttecLiv>u arc examples ot aome ot l life Kiudto 
of analysis which personnel ofticials t^hould perform to assess a manage- 
ment workforce plan. They can be used to answer such question^ as: 



X o 



jottttltle Lo provide Llie woiklorcc reqiiired by Llie piaaV 



an workelo needed be prox/lded,Ou tl^l tUtue acheUuie wiilch 

ha-^ been established? 

How mucli will providing Llie required woikloicc cofc^LV 

if auAl^t^la Shows LhaL a wotkforce plan la in bome way iufetittiLlc, uheu 
mauagen^nt should be (a) informed on why the plan is not f easibl e ' aild 
(b) advised on possible adjustments that could be made to it. 

Confidence Kan^e for ?ro]fccLed Values 

lu a previous section, w^e showed how to uae Lhe lo^ piobabllicy equatiou 
to obtain estimates of percent retaiuod at future times. Such projec- 
tions give the most probable values of future retention. ^ reality, 
the victual retentioa values will most likely fall within a fange of 
values surrounding each projected point. ^l|:hl8 range c&n be found by 
using an important property of the standard error; i.e., that 95% of Lhe 
deviations from a least squares line will fall between -2s and +2s. 

This property makes it possible to calculate a range of values into 
which an actual retention^ figure will fall 95% of the time. This range 
is known as the 95%- or confidence range . 

Once a log-probability equa:tion has been calculated, the major steps in 
calculating the 95%-range are: 

1. Iterate the log-probability equation for the known points and 
find the deviations between the actual values and those given 
by the log-probability line. 

2. Compute the standard error. 

3. Project the log-probability equation for the values desired, 
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4. 



For each projected value, calculate the upper and lower 
limits of the range in standard deviation units (using + 2s as 
boundaries) • 



5. Convert the projected values and th^ range limits to their 
corresponding percent remaining values. 

6. Convert these percent values to numbers of employees. 

To illustrate this process, we fit a least squares line to the following 
retention data: 

Years Since Percent 

Hire Remaining \ 

1 58% 

2 44% 

3 36% 

4 30% 

U8;l4tg">^ averaging factor of 0.5 years and converting the time values to 
togs anoM^he percent values to number of standard deviations from the 
mean of thenormal curve gives: 

Log Stan. Devs. * 

Year From Mean 



0.17609 0.20189 

0.39794 -0.15097 

0.54407 -0.35846 

0.65321 -0.52440 

The log-probability equation that results from fitting these data is: 

y « 0.4^38 - 1.5171X 

New we are ready to begin the 6-step process of calculating the confi- 
dence ranges associated with the projection of this log-probability 
equation. 

Step 1 . The deviations from the least squares line represented by the 
log-probability equation are calculated by: 

a. Iterating the equation for each*^ given x-value to get the value 
given by the line (y^ ). 

b. Calculating y-y*^ for each given x-value. 
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These calculatio^fe are performed with the y and yl values in units of 
standard deviations from the mean of the hiScmal curve; i.e., without 
transforming them back to percent remaining values. 

Going through this process for the first x-value (1.5 years) gives: 

= 0.4638 - 1.5171 (log 1.5) 
=■ 0.46:^8 - 1.5171 (0.17609) 
. ■ - 0.19667 

The actual y-value at this point was 0.20189. Thus, the difference 
between the line and the actual data at x - 1.5 is: 

y - yl = 0.20189 - 0.19667 - 0.00522 

Each of the other deviations is calculated in the same manner. The 
process is summarized in the following table (which also includes the 
square of the differences) : 



X 


Z 




y_yl = d 




0.17609 
0.39794 
0.54407 
0.65321 


0.20189 
-0.15097 
-0.35846 
-0.52440 


0.19667 
-0.13992 
-0.36161 
-Q. 52719 


0.00522 
-0.01105 
0.00315 
0.00279 


0.000027 
0.000122 
0.000010 
0.000008 
0.000167 



Step 2 . The standard error about this line is: 

« "JmT ./ 0.000167~ 

. >N-i y 3 

= Vo. 000056 = 0.^07^8^) 



Ste2_3. Suppose that we wish to project the equation through 10 years 
since hire (using x-values of 5.5 to 10.5). .This gives: 



Years Since 

Hire 


5 
6 
7 

8- 
. 9 
10 



Projected 
Value (yl) 

-0.65940 
-0.76947 
-0.86375 
70.94622 
-1.01950 
-1.08545 



Step. 4 . To calculate the upper limit (U) and the lower limit (L) of a 
confidence range, we use t/tie formulas: 



2s 
2s 



Where represents a projected value. ^ 
In this case, 2s - 2 x 0.007A8 - 0,01A96. 

For the first projected value (y^ -0,65940), the upper and lower 
limits would be: 



U ■= + 28 
= -0.65940 
= -0.64444 



0^ 



01496 



And, , 

■f 

L yl -28! , 

= -0.6594b - 0.01496 
-0.67436 

Doing this for each projected value gives: 



Te^rs Since 
' ^Hire 

5 , 

6 

7 

8 

9 

10 



Upper 
Limit 

-0.64444 
-*0. 75451 
-0.84879 
-0.-93126 
-1.00<^54 
-1.07049 



-Hs^ge (in ^tan.' Devs.)- 



ected 




Lower 
Limit 

-0.67436 
-0.78443 
-0.87871 
96118 
3446 
-1. 10041 



Step 5 . So far, we have been working with standard deviation values. 
Now we can convert each of the standard deviation values to percent 
(using Table 4-2 and interpolation). This gives: 



■ • > 
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\ 



-Range (In Percent)- 



Years Since Upper Projected Lower 
^^T^^ " ^ LiMt X^V Value L^it 

. 5 25.^ > 25.'48Z ^»25.00Z 

t . . ... 22.53% -22.08Z ■21.64Z 



I,: 19.801 19.39Z 18.98Z 

? . , 17.59Z 17.20Z 16.82Z 

,1 . . ■ . .15.76Z 15.40r ^ ' .15.05Z 

- . 14.22Z. - 13.89Z it 13.56% . 



4 \ 

Thus, for ewmple, there is a 95Z chance that the percent of the origi- 
nal group retained at-the end of the fifth year since lilre will 
between 25.00 and 25.96. In other words, 95 times out of 100 the actual 
retentio^ percent will fall between these two values. 

A graph of this sample retention curve showing the confidence ranges for 
the projected values Is found In Figurfe 5-2 ."^ ^ 

Ste£_6* To convert these percent figures to/nuiid>ers of employees to be 
retained, simply multiply the number In the original group by , each of 
the percents (In their decimal form). Suppose, for this exaii?>le, thaf 
there were 250 employees In the starting group. Then the range values 
would be: ' > .i^, 



-Range (in tlumbets)^^^- 



Years Since 
Hire 




Upper # • 
Limit 


Projected 
Valuie 


5 




• * 

64.9 


^ 63.7 - 


6 


t 


56.3 " 


55.2 






49.5 


48.5 




44.0 


•'43.0 


9 




^ 39.4 


38.5 


10 




• 35.6 


34.7 



Lower 
Limit 

62.5 
.,54.b 
\47.5 
42.1 
37.6 
33.9 



These figures may be ^^^^^ planning purposes. ^ ^ 

Each of the functions 1ffiW3lved In fitting and projecting a retention 
curve that has been discussed In this chapter can be done by an already 
developed computer, program. This program ^ia khe subject of the ^ext. 
chapteri , - - ^ ^ 



CHAPTER 6 

STAFFING NEEDS PLANNING COMPUTER PROGRAM: 



LOGPRO 



Analysis Functions 

i . % ■ 

The computer prhgic^jto LOGPRO has been designed to automatically perform 
the basic retentloi^^ajialysls functions described In the preceding chap- 
ter. These functloif^ are: 

. 1. The calculation of the a and b values of a log-probablllty 
equation using: 

a. Operator-supplied retention dat^ and 

b. Least squares 



Lied retention datA and 
tefchnl^ues; \ 

le log-probahlHty ©qnatlon for 




2. The Iteration of the log-probablHty ©^nation for each oper- 
ator-supplied x-'value; ^ y 

/'■ 

3, ^ The cotnputation of the standard, error about the log-prebablllty 

line: / ' ^ ^ 

A. The projection of the log-probablllty equation for an operators- 
supplied projection period; and 

5. The calculation of the 95% confidence range for each projected 
value* ^ 

' , ■ « ■-.<...'■■ 

LOGPRO (and each of the other staffing needs planning computer programs 
discussed in this ^andbopk) forms a self-contaihedy^comprehensive -package 
consisting of (A)- the rn^in program ^(LOGPRO) \^lch performs the' retention 
analysis and '(b) l-he subprograms (ANDPX and ANDXP) which handle the ' 
ansformatlons from "percent retained" to "number of standard devla- 
s from the mean of the normal curve" and back again. A user of this 
^t of programs need not ha^e an extensive /statistical background In 
Tder\to. successfully utilize* and evaluate Its results. 

.0 is presently set up to be used in a time-sharing environment with 
sex supplyliig retention data from a demote terminal during the run 
e'program. In other words/ th^ fSrogram will a^k the user for 
irfrq^j|g^tj|.on and the user will provide it. 

r\m of LOGPRO consistjs .of two main anj^ysls. sequences. They atje: 



Figure 6*1 



— ■ . > ■ : 

PLEASE ENTER THE NUMBER OF YEARS FOR WHltH RETENTION DATA ARE AVAILABLE 



PLEASE enter' THE NUMBER IN /HE STARTING GROUP 

( 



2S0 



WILL INPUT DATA BF IN 

( 1 ) NUMBER OR 

(2) PERCENT 
(/\NS ! OR P) 



PLE/iSE .ENTE^' THE VO'l-E'^OF X.(TT^n AND Y ("UWpf'R 0^ PF'CrvT 

RETAINED ^'^ f '^fW, , Mf*V'!r''" Al ' ? ' t r^ i MJI 



i. 


s, 








.^6 



yvyyvyyyvyxyyyyyyyvyvyvvyyyyyyvyyvyyvyyyX'XXyyVvyyxVyyv y Vyy v yvvyvvyyy 

y 

TABLE or L0G-PP0RA3IL I Jy AN/VLVSIS RESULTS 



--ACTUAL DATA— --L - P CURVES- 
YEAR NUMBER PERCENT NUMBER PERCENT 



1.50 1^5. 58.00 57.80 

2.50 no. ^^.00 HI'. 09 A kk.kU 

3.50 90. . 36.00 ^ 89. 71 35. 88 

^.^0 75. 30.00 •^7^.76 29.90 



THE NUMBER IN THE STARTING GROUP WAS: 2S0 

TH£ LOG-PROBABILITY EQUATION IS 
> , Y = a.^6382 + ( - 1 . 51 70 7) X 

. THE STANDARD DEVIATION OF FIT IS 0.007^69 

xxxxxxxxyxxxxxvxxxxyxxxyxxxxxxxxxxxxxxxxxxxxxxxixxxxxxxX'Xxxxxxx.j^xxTxyxxx 

— ^ — — ' — ^ ^ ^ — p 



1. The iQg-probability analysis of the given retention data; and 

2. The projection of the calculated log-ptobabilit^ equation. 
Log-Probability Analysis ^ 

During this sequence, LOGPRO performs all the calculations required.to 
fit a log-probability line to longitudinal retention, data (including 
functions 1-3 above) . . 

The information fed to the computer by the izr^rator .for this phase of 
the pr,Dgram consists of: ^ . 

K The number of years for which retention data are avs-llaMP: 
i.e., the number of known retention points. 

The number of employees in the original or starting group ^o' 
the occupation to be analyzed. 

Whether the oollected retention data *^re in the form 'nuipb'^^ 

/♦. Thp acfiial rprpnrion data. 

An example of the question and answer sequence for this part of LOGPRO 
is shown in Figjjre 6-K Responses made by the operator are underscored. 
After each reque>tKfor information, the computer pauses and waits for 
the operator to respond to the request. Each user response^ is followed 
by a carriage return \ This sends control of the program back to the 
computer. 

The data used in the sample run are: ^ 

Years' Since Percent 
Hire Retained 

1 ■ ., 58% - 

2 44% 
- 3 36% 

4 » .30% 

Number in the original group = 250 • 
Averaging factor = 0.5 years ■ 

Since there are four pairs of retention points, the answer to the first 
request for information is simply 4, And, since there wete 2,50 employees 
in the starting group, the answer to the second request is 250. 



The next question requests that the operator make a choice based on the 
form of the retention data. If da^ta have been collected in "number of 
employees retained" (i.e. , without conversion to percent) then alter- 
native (1) is desired and the operator response is K In this. case, the 
program will, automatically convert these values to percent retained. 

If the data have been manually converted to percent retained, then 
alternative (2) is desired. And the operator response is 2^. Since this 
is the form of our sample data, the response in Figure 6-1 is 2^. 

Next comes the^entry of the actual retention data. One important point 
to iremember for correct entry^"^ of these datg ^Is that all decimal points 
must be shown . This rule applies to evety';,piece o^" data entered at this 
time including whole nt(p\bers. For px^inpl^^j; v^^r 1« cmterpd op 1,. 100 
emptoyees ie entered p« 106 .^ 

In. addition, when the "^ta pre in ^^*^cenf, ^he decimal form of the pe* 
cent value Is ent**''*"^ oSf^mr^^l ^^"^ p^r^r^rc*^ op ,581- 

p.nfp^ ed f^R .58^2- 

Tori, »of#-»>ff'.»» pflfr fo pnf'C'r^rl on n sl'^pl** Mn*^ «Ko <.-•»>«• 

Entry continues until the s-upply retention pa-lrs is exhausted. 
LOGPHO takes this retention data and converts it to log-probal^Llity 
form. Then the linear le-a^t sq^Jares technique 1« used to determine the 
a and.b values of the lo£-pro>ibiUty equation.- This e^quation is Iterated 
for each given time value and the standard error is computed. The 
results of all these calculations are then printed out in the ''Table of 
LoR-Probability Analysis Reetults" (Figure 6-1). 

"... 

As you can see, this table gives you, in a convenient form, both the 
inputted retention values ("ACTUAL DATA") and the results of the log- - 
probability analysis ("L-F CURVE") so that they can be readily compared. 
In addition^ the computer prints out the number ^n the starting group, 
the a and b values of the log-probability equation and the standard 
error of the log-probability fit (in standard deviation uhits) . 

Projection Analyajg ^ 

During this sequence, LOGPRO performs all the calculations required to 
project a log-probability equation and calculate the 95% confidence 
range for each projected value. 

The information given to the computer during this phase of LOGPRO con- 
sists of: ■*<fK 

1. Whether the operator wishes to p'fbject tne log-probability 
equation (a yes or no decision); and j 
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2. The time parameters of the projection. 



An example of the question and answer sequence for this phase of LOGPRO 
is found in Figure 6-2. This sample run projects the log-probability 
equation calculated in Figure 6-1. 

At the beginning of the projection section, the operator may choose 
whether or not to project- the calculated log-probability equation. If 
not, the^ computer will skip around this sequence. If yes, the computer 
will ask for certain information concerning the time frame for which the 
projection is wanted. At J^^pg; point, the operator must enter three 
values. They are: ]:^'^^;^^r 



1. 



The first (or TnlnTTnuw) t-fme valtie for v^icb n pm-jectlon 
wanted. ^7 




The last (or m^v^iDiiTn) ^Imp value for wMij^h t^ proleot^on 
wanted. t 

L 

^ The le'^gth of rhe time interval v»otw«*oT> riiTrr«*oc j ^^r* yo ^o< » ?^>r^ 

(>ofntR (or tbp tlm^p 1 nor fjthon f ) . 



r» »\ <^ VT> 



These ^^aluee *^re entered sid<»-by-sid« , separated by commae 
Figvirp ft ?) . Hprp pgp-fn a_ll dpr1w«1 polnfo mnpf hp f2>iown 

The sample run in Figure 6-^2 projects the log probability <?quat-Ion for 
the timp values 5.5 years to 10.5 years. The increment in this c^se Ip- 
1 year. This means that projection points will be calonlf»te<l for e«cb 
of tbe^e values: 5.5/6.5. l.si 8-5, 9,5. and 10.5* 

LOGPRO now takes tliese time values and projects the log-probability 
equa^on for the given time frame. It also calculates the 95% confi- 
dence ranges for each projected value. These ranges are calculated in 
both projected numbers of emjJloyees and projected percent * retained* The 
results of all of these calculations are printed out in the "Table of 
Projected Values." This table contains, for each time value, the pro- 
jection (EXPECTED VA^^UE) and its 95% range for both projected numbers 
and projected percents. 

After printing out the projection table, the ciomputer will ask the oper- 
ator if there are any :more dat% to be analyzed. If there are, the computer 
will recycle to the beginning of LOGPRO; If not, the computer will stop 
the execution of the program. 



Summary 



LOGPRO and its subprograms form a complete statistical package for the 
analysis of retention data which are: 



Figure '6-2 
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1. Longitudinal - I.e. / employee groups (ar cohort8> are followed 
over time and counts are made of how many employees remain on 
board after certain lengths of time. 

2. Occupational - I.e., data are, collected by occiipatlon In all 
cases. Data are also entered by occupation except In those 
cases where aggregation of occupations is possible (see 

^ Chapter 8). 



LOGPRO uses the statistical techniques described in Chapters 4 and 5 to 
perform its analyses. A more detailed discussion of the mathematics 
used by LOGPRO can be found In its Technical Analysis which can be found 
in the Appendix section along w^th its Operation Manual agg Program Llj^tlng . 

Although LOGPRO is an^extremely useful tool for ret^tlon mialysis, 
there are apu^ situations where it Is Infeaslble to use it directly, 
Bucb as In organizations ^th: ' 



Hiring patterns which v«ry widply frnm y^^r to v^*»> » 
Low levels of hiring^ 

Mflny occupations having only «i few employees. 

The techniques used to handle these t>n^ f^imi^Br 9ltt«»tionB will ho 
explAin^d in tbo n«xt two chi>rter«?. 
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CHAPTER 7 



STAFFING NEEDS PLANNING COMPUTER PROGRitfl; -"^ ^ 



LPFILE 

4*' 



So far we have been discussing techniques of retention analysis which 
require the ii^e of longitudinal data. However, there are cases where 
either (a) longitudinal data are not available or easily obtainable or 

-(b) organizational blring patterns do not readily adapt to a LOGP^ type 
of analysis. We now need a way to apply the log-probability analysis 
technology to such cases. For this purpose' we Have developed a second 

^retention analysis computer progrmn. This second program is known 
aac LPFILE. ^ , 

i. 

When To Use LPFILE ^ 

There are ceik^l^^ituatlons where longitudinal retention analysis 
becomes Ifess precise and, thus, where the use of LPFILE Is recoimtiended . 
Th^ three most common of these situations wfll be dlstussed. 

^. 

1- The number of employees hired into one or more of an o^anl- 
zatlon*s occupations during any one given year Is relfl:tlvely 
small. ' 

This type of situation can occur both in small organizations with few 
employees knd in large organizations consisting of a scattering of many 
occupation^ with few employees.. In such cases, the use of longitudinal 
data is limited because a low level of hiring leads to toall cohort 
groups. And. small cohort groups, like small statisticaj. samples, are 
more subject to error|than are large groups. Also, a sma^l cohort is 
less likely to have a truly normal distribution of work attitudes (since 
the likelihood of the exlst«nce of a normal curve Increases as the 
number of work attitudes Increases). In addition, for a small group, 
the relationships between standard deviation values and percentage of . 
cases change cpnsiderably. 

2.\ The yearly pattern of hiring into an occupation changes from 
one year to the next. 

One example of this type of slttiation occur s'^fflXga '^l^g into a given 
occupation is concentrated early in one year aMXate in another. Such 
yearly differences In hiring -p^^temB will cause'Jearly changes in. the 
time-averaging factor which Is usei to caldul&te^the log-probablli,ty 
curve ,(e.g., 0.3 In dne year and 0.7 Iji the next),. This" means that a 
new'dquatlOn must be computed for each year's group of hires which in 
turn means continually collecting and recording longitudinal data for 
each group. Thus, suA a situation makes it difficult -to apply a log--, i 
probability equation computed from a group of employees hired in one 
gflven year to employees hired in any other year. * 
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For example. Figure 7-1 plots a log'-probability equation computed using 
an averaging factor of 0.3 years against longitudinal data for a group 
of hires for which this actual averaging factor was 0.7 years. As you 
can see, any type of projection or calculation using this equation for 
this particular group of employees would be consistently low. 

In addition, under these^ conditions, the use of a single log-probability 
equation to perform the analysis applications^ d-iscussed in Chapter 5 
becomes Imprecise, This is because the applications were designed to be 
usiBd on all employees irf a given occupation regardless of their year of . 
hire. 

As a corollary to* this situation, it might be that hiring into an occu- 
pation during a given year is so unevenly distributed as to make it 
difficult to use a convenient time-averaging factor. In such cases th« 
only recourse would be to compute an actual average-length-of-service 
figure using bhe entry-on--duty datP of every employee In the cohort. 
This may be too cumHereome a tusk, particularly if the rp^RMlt^nt *^^ffr 

One vear's log^pro>>ability rVtent^on curv- for a given occvpa- 

tiOn is noticeably f^ffforonf- from ^^><>t'>>or vogr'p rx^rvr^ f ♦ 
ppTTip or^rnpfl 1 1 on _ 

It. may be fhat obsprvation of diffei^ent years' cohort groups fon an 
occupation reveals noticeable (although not-Nstatisticall / sismlf leant > 
differences Mi the retention curve for the occupation. This usually 
happens wh^n the number of employees hired in the individual years is 
relatively small, so that random variation differenc<>s tend to he rela- 
tively large. What is needed in such cases Is a^way to flvorage tbp . 
curye dif f erences-- wh^ ch Is what LPFTLE wilt do/ 

The LPFILE -Method 

The basic motivating factor behind the. development of LPFILE was the 
need for a method which would handle each of the above situations by 
making the maximum possible use of every bit of Information available. 
Data on every employee hired into an occupation during a selected time 
^span ,(say, 5 years) are inputted to LPFILE. These dat& are used to 
compute a log-probability equation fot- the occtipatiW. The system used 
has several advantages: / 

Since data on every employee are utilized, every available 
contributor to the group retention trend is used, to compute 
the group's equation. ^ * 

- The si\^e of the s^ple used to calculate the equation is 

nlaximized. (This is particularly imp<^rtant when hiring, levels 
' are low, ) 
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Use of all the years of data available gives the maxintum 
^ • possible precision to the result. 

The employees used to compute the equation are not limited to' 
--^ one year'p hir«s. 

There is no need to continually collect and record longitu- 
dinal data# ' > 

LPFILE uses a* previously-stored file of ^ployee data to coiinpute a log-, 
probability retention equation. In this section we will discuss first 
the mathematics used by XPii'ILE, Then the &et-up of the data file, and 
lastly examples for bpecifit sets of d&ta. 

The Mathematics : Suppose that we knew for the past, say, five years (a) 
the date when every employee was hired into a given occupation and (b) 
the date when those employees who left the occupation dJLd. so. With this 
knowledge \Je could develop a quasi-longitudinal retention' curve for the 
occupation to which we could apply the log-probablllty analysis Technique. 

We know from the previous discussions of log-probability analysis that 
two items of data are needed for use of the linear least squares techaique : 
(i) the percent retained ^rom a given group of employees ^t (2) specnTc ^ 
times. For LPFILE we need ^to collect dur data so that we^ can construct 
replicas of these data "items. To do this we tirst need to select a time 
per'iod for which data collection will take place. This could be the 
past two yejj^rs, three yeSrsSfcr whatever time span fo^ which your organ- 
lization ke^s chronological records of GH>loyees hired. * ^ ^ 

Setond, we record,^ for every employee. hired Into, a given occupation . 
during t,hat time period , one or two dates: 

■4 

1. The date of hire into the occupation (for every employee); 
and 

^ 2. The date of separation from^the occupation (for those employees 
? who - have separatediA during the time span). ' ^ 

These dates become part of the data fil^e wHlch is jus ed rby. LPFILE. 

We need one other date for tlj^ use o* the LPFILE method apid^ i£ is known 
as the "file ending date." ,'^'i^ ie^ the closing" date of 'the ^ile and of 
the time period under study. 

T7 Note that, as discussed at'the beginning of Chapter 4, a'-'i^^ar^t 1911" 
.is any personnel actju» which 'c^^ates a vacancy in the^group being' 
analyzed./ Thus, a moveilient from this ?o some other occupation within 
. - the ^ame organizat^^on is a '^'separation." * v 

. v. ■ . , . .^^ . - 
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LPFlLE uses these dates to calculate two numbers for every employee in 
the group. The values of these numberte will depend on whether or not an 
employee has separated from the, group. 

If employee has separated, ^ the numbers computed are: ^ 

* ' • . * 

,1. The employee's actual time on board , in the occupation (i.e., 
I Date of Separation - Date of Hire) ; and 

2. The length of time that the employee could , possibly have been 
. on bpard during the selected time span if^he of she had notr 
sepfitrated-before the end of that time (i.e.. File Jlndihg 
' Date - Date of Hire). 

\t an employee has not separated, his' or her actual time on board will 
be equal to his or her possible time on hoard. So, for these employees 
the two cjj^mputed values are equal. 

LPFILE recc^rds each of the computed 'valued in one of two* arrays. The 
first array consists of the values of the actual time on board for every 
employee in t^^ group. The second consists of the values of the elapsed 
time between every employee's date of hire and the file ending date. 
(These last values are either "possible time on bpard" or "actual time 
on board" depending on whether 'or not an employed has separated,) 

These computed values are then used to develop a distribution comparing 
the ntmber of employees iij the group who could possibly have served for 
a given. length of time with the number of employees in the group who 
afctually did serve for at least that same length of time. 

* 

LPFILE now uses the results of its computations to set up the k, y .pairs 
for log-probability retention analysis. First, each different 'value 
from, the elapsed time array biecomes a length of time from original hire 
for those employees who could possibly have served, (or who actually did 
serve) for. that, length of time... These values then become the time — or 
X — values fot log-probability analysis. 

The percent retained (or y) values are calculated us^ng the following 
data substitutidns for each x - value: * ^ ^ 

1^^ number .of employees retained, at time x =-the niimber of » 

employees vhq actually stayed with the^group for at leaiBftime 
X (including those employeerf who separated at. soinB time latejr 
than x) . ' ' \ , 

The number -of employees in the original gajoup of hires «= the 
number of employees who could, have stayed/with the group for 
at l,east time x (including both employees who left before time, 
X and einployee^ who stayed longer than time x) . 
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LPFILE DATA COLLECT'iOPJ FOkM 
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V F6r each x > valuer, data itc^myffl Is divided, by d^ta Item 02 to obtain an 
« est'lmate of the percent; ret^^ined at' that time. And these become the 
values-., ■ • 

''^ LPPILE uses these x, y pairs and tH?'"deta transformations' found In. 
L06PR0 to calculate a least squares log-^probablllty equation for the 
glyeii group Th6 resulting equation can be\used In the same way as an 
equation derived by LOGPRO. 

The Data File t To perform its calculations LPFIJjE needs to be supplied 
vlth a specially set up file of employee data, this file consists of 
one record for each employee In a group (or groups see discussion of ^ 
possible file combinations l)elow) • There are four data elements In each 
record. They are the employee's: 1 

1. Occupation or series code; 

2. " Grade at lilre; ^ { 

"3. Date of hire (month and year); and 

4. Date of separation (month and year). If any. 

The last two Items are, of course, those we discussed In connection with 
the method used by LPFILE. The first two Items permit the user to both 
(a) Include more than one occupation In a dingle file and (b) perform 
log-probablllty an'ialysls on employees In different entry*-grade levels. 

In most organizations, tljese data Items can typically be found In a 
chronological file of personnel actions In the personnel office. Or ' A 
they may be recorded In an organization's automated infot&atlon^ system. 

Before data collection Is b^gun, ^decisions must be made on (a) the time 
span fdr wfilch j^ata will be collected and (b) the occupations which will 
be studied I Thds Information Is then passed on to a computer prognammer 
(If the data system^^^^e^utomated) or to the person who will collect the 
data manually. y-^ - 

If data are to be colieidted malnually. It Is helpful tb use some tjrpe of 
a standard collection form such as the one In Figure 7-2. You will notice 
that an extra data Item has been Included In this form. This Is an 
employee Identification num1::)er (e.g.. Social Security Account Number). 
This Item Is needed since the other four data elements will most likely 
be found on separate sheets of ^ap^r. However, LPFILE does not need It 
to do Its work. A partial sample of a filled out collection form can be 
found In Figure 7-3 (A). p \ f 

To get t;^ "data from tihe collection form into a data file In the computer 
so that LPFH;E can use It, a few rules must be followed: 



- 93 



• FiJBure 7-3 ? 
A, . LPPI.tE O'ATA CGLLtCJION FUf^M 
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CC^'pmTcR DATA FILE 

002010501 731 1 74 ^ 
00201051 1 72COOO 
00201050772C1 75 
002C 1 O50971O375 
0^2C105C773000b 
002010509730575 
-00201 050^*720000 
002C105C1 72C275 
0020l05ft,57*40000 
OO2OIO5I 1 730674 
FILF ENCING DATF - JUNE 1975 



LPF ILE Tl/iE VALUES ^ ' ' . 





COULD HAVE 


ACTUALLY 


FMPL07EF 


SERVED 


SERVED 


NliMSFf^ 


_(.YEARS)__ 


(YEARS) 


1 


2.42 


K83 


? 


2.58 


2.58 


3 


2.92 


2.50 


; 


3.75 


3.50 


5 


1 .92. 


1 .92 


f> 


1 . 75 


1.67 


7 


3-17 


3.17 
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■ 3.42 


3,08 
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1 .08 - 


1 .08 


1 0 


1 .SB 


0.58 



- 94 



fa 



. * Each line on £he collection form becomes a line of the data ; 

2. [ The data elettients. In each line are placed In a specific order j 
I Vith occupation code first, then grade at hlre^ month of hireV 

year of hire, month of separation, and year of separation. \ 

'3. If an employee has not separated then the entries for })oth the, 
month and year of separation are zero. 

4. Depending* on the available computer system, thel^ata elements 
- . in each line are either (a) separated by commas or (b) spaced 

^ according to the criteria set up by AFFILE. 

For rule 4b, the^ following spaces are allotted, ^or each data eleiQent: 

- Occupation code — ^5 spaces. (To handle series codes of up to 5 
digits). . , ' ' 

- Grade at hire — 2 spaces. , >y 

J - Month and year o/ hli/e, month and year of separation — 
/ 2 spaces each. / 

/ ^ ' i 

/ If a data element refqulres le^s than the allotted space, then the test 
of the area designated for that element is f llle^ vith leading zeroes or 
blanks. " 

As suggested in rule 4, above, thei;e are Xwo wkys in which the data from 
an LPFILP collection form may ^e transforiitied into a computer-acceptable 
file. First, if the diata elements in^a line are to be separated by 
commas, then the first two rows from the sample collection fiorm in 
Figure 7-3 j[A) would translate to: --^^ 

" ^ 20i; 5, 1, 73, 11, 74, ^ , , 

20i; 5, 11, 72, 9, 0 

N6te that, since the Second employee has not separated, the last two 
vvalues in the second line are zero. 

Second, if the data elements in a line are to be spaced according: to 
LPFILE*s specifications, then the fi'f^wt two lines of the sample collec-' 
tion form would be, entered ns: 

J 

, 002010501731174 
00201051172000Q ' 

Note that (a) the elements follow each other without any separatloi^ 
character between them and (b) leading zeroes afe used to fill unused 
spaces. ; (In most systems blanks may used in place of the leading 



Ueroes.) The completed file based on the sample collection form may W 
Eound In Figure 7-3 (B). ^ 

The transfer of data from a collection form into the computer may Ke 
done directly or there may be an intetmjfediate etctp which transfers data 
first tp a coding 6hfeet (where the data are arranged in computer-acceptable 
form) then to the computer. * ' 



In any ca^, entry of the file into the i computer from the keyboard of a \ 
Computer termiiial follows standard steps which include:, * 

1. Givinf >the data file Its own name; 

. 2. : Entering the data file one line at a time; 
<■ . ' •« - . . ' ^ ■ * ' 

^^3. Hitting the "carriage return" key atj^the end of each line; And 

. • ■ / ■ '^"^ 

4. Storing the file in the computer's memory area. 

Once entered and stored, a data file can be recalled at any time, for use 
by LPFILE. . ' * ^ 

The contents of these dat;a files may cons:|.st of one occupation pet file 
or multiple occupations per file. A single file may .also contain moire 
than one grade at hire level. During a run of LPFILE, the user may ' 
select^hat portions of the file are to be analyzed. For example, if 
a file contains records on three occupations only one of which is to be , 
analyzed during a given* run, then LPFILE will use only those records of 
employees in that occupation. / 

Each file will have a filfe ending date which -will be entered frbffl the 
terminal keyboard during a run of LPFILE. ^ 

Examples : As a simple example of the techniques us^d by Lf FILE,* we will 
use the sample file of 10 employees found in Figure 7-3 (A).' Of course, 
it is unlikely that auch a small fi*fe would exhibit log-normal character-- 
istics and an actual f il^ vould' cousiBt of many more employees. But a 
small file is useful ^ inuRtratrlon purposes. 

Let's ;iBBw^<» fhat: rh^ f M ^r»H^1>^^ f^3^o for rb-f« sampl^flle Is June 
197S. 

First we have to calculate, for ea* ^ emrloyee in the file thfe values for 

actual and pos-ible t'me on ^. ' pv^mpl^, rov9i6f^r the data for 

employe*^ //I Flcrnro 7 

Date of hire ^ 1/7"^ ^ 
Date of separat- • 
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/Tbis eint>loyee cpuldj have seized froin Jfiinuary 1973 to June 1^975 .(tlfie file ^J*^ 
ending date). , In. other wo^rds, the employee could have 8er\red ffer 2 -y^;^- '^ 
years aniJ 5 mb]ftth8 ,br 2.42^^^^^^^^ But the employee actually s^ed « '^'^'^^^ 

mtll lIpvM>er: ld74| br l,y*iar •attd 10 ikonths dr 1*83 years. \ \ V^l^f 

;'• ^ .];-'^' , ■/.■/■^V','- . ''-^ / V 

Each esq) loyee's tlite valjlies are calctxlated ija the same matiner. And fo't ^^^^^^^^^^^^ 
t^hbse employees wh6\ have! not separated, the two values are equal. For > . ^, 
^. €bcampl€i, for employee #2'^ the actual and possible values are both 2.^8 t i; ' 

years (from Ndvembeir 1972 to June 1975) . ' 

TO contplete'd table of actual/ and possible tiile vali^s is found in . 
\^Figure 7-3(e). Arran^ng tljiese values from lowest ' to highest possible 
. service, gives: ' ^ - 

■ • "v .". •• '■• .'' ' ^ ^' ' ''V ■• " . • • ' ■ • 

.i 1 ' ; ^ Could Have ' Actually I ^ 



■■i 



V, 



Served 




Served ^ 


(Years) , 




(Years) 


1.08 




1.08 


1.58 




» 0.58 


1.75 




l.W 


' 1.92 




1.92 


2.42 




1.83 


'\ 2.58 ' 




2.58, 


• 2.92 




2.50 


3.17 




3.17 


' 3.42 




3.08 


3.-75 ' 




3.50 



The next/step is to translate these values into log-probebility ^, y 
retention pairs. The "Could Have Served" column becdmes the x-values. 
These represent "Tears stnce Hire" values. To determine the. percent 
retained for each x-yal^e two coiints are made: 



' 3^^^ of employees in the *group who could have served to 

/■''-^■S--^;:,,;.^itiine''i5 anJi ■ . 

2. The ^TUlmlj^er of employees , in the group who did^serve to at least 
time x. 

For example, the first x-value is ^-08 years. All ten employees, in the 
^oup could have eteryed for this time but only nine of 'them actually did 
(employee^ #10 left after 0.58 yeari> of service). This gives a retention 
ratio of 9/10 or 90%, ' * 
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The next, x-value is 1» 58 years. Nine of the employees in the group ^ 
could have iserved for this time (employee #9 Is no longer in the run- ' ^ 
nij&g). Of these nine, eight actually did serve for at least' 1.58 years. ' 
Thus the r^tfantion ratio- for 1.58 years is 8/9 or 89%. .And so on through 
each x-v^lue. ^ 



The retention values for this sample file would be: 



Years ,Slnce 




Percent 


Hire •' . •' 


Retention ^ 


- Retained 


(x) 1 ; 


.Ratio 


(y) 


1.Q8. 


. 9/10 


90 


1.58 


8/9 


89 


1.75 - , 


. 7/8 


88 


, 1.92 


6/7 


86 


2.42 J v 


' - ' 5/6 


^83 


2.58 


• 4/5 


80 


2.92 


3/4 


< ' « 75 


3.17 


2/3 


67 • 


3.42 , : 


1/2 


50- 


, (3.75 .. • 


0/1 


■ ^ 0). 



(Since /ther^ was ^no one left to contribut^e to the group retention trend 
by 3.75 years, this 'tim^ value would not enter into the log-probabi-lity 
equatioti calculations,)' * 

" ^• 
LPF-i^LE utilizes the linear least squares technique on these x-'and y- 

Ig. value a to compute a log-probability equation for the given group. , 

As^ we said before, a file consisting of only ten employees ^ is unlikely 
to exhibit true log-normal behavior. To illustrate what LPFILE data 
will look like in a practical situation, we have constructed two larger 
sa?iple files. The first is a hypothetical file for a prof essiona], 
occupation. For illus,tr^tion purposes, we have used the GS-801 (General 
Engineering) occupation. This file, which we have named LP801,. is ^ 
listed in Figure 7-4. The secon<J sample file represent^ a^ clerical 
occupation. For this file, we b**v<^ uped OS-322 (Clerk-Typist). We have 
named it LP322 (pee Figure 7^^V. 

To show the type of retention pattern which is exhibited *by such files, 
we havd plotted in Figure 7 f the retention pairs derived from. file 
LP322 (the scattered dotj) . As vou can see, there is a diistinct and 
fairly even log-normal retention pattern exhibited by these data. The 
pattern deviarfes somewhat in the tail area of the curve. This is due to 
the smaller numbers of employees who contribute to the trend at th^ 
higher length-of- service levels. 



The log-probability curve which LPFILE ^tted to the I1P322 data is shown 
by the dotted line in Figure 7-6. 

\ 
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In Flgure\7-7, the LPFILE, fitted curves for both LP322 and LP80r are 
^plotted, TMs grapfi shows that the existence of occupational differences 
as discussed for longltudinal^data ar« still In effect under the LPFILE 
system. In^^ddition^^^^lti^ ,the LPS^Ol file contains records for both GS- 
5 and GS-7 hire^^^--ttie two retention curves basjed on these'^if f erent 
fentry-levels are plotted in Figure 7-8. This graph suggests th^t there 
may be curve differences between, different entry-^grades that require the 
usi^ of more than one log-probabillty, equation. There^)w4it-(be more on"^ 
. su^Ji. differences in the next chapter. 

Saa^le Run of LPFILE f \ ^ , ^ 

Figu^iE^ 7-9 shows a sample run of the ptogram LPFILE. There are two • 
sections to this. run: information inputs and analysis outputs. 

The f ttst piece of information to be input is the file ending flate for 
the fMe to be analyzed. This date is enteredsj by typing the value of 
the monith (from 1 to 12), and the last two digits. of the year (e.g. , 75 
for 19^), separated by commas. 



The nex^itto needed by LPFILE is the number of employees in the ^J^oJ® 
file. 0% in other words,, the number of lines in the file. Then Tne 
name of t^e file is 'r^uested. As presently set up in tPFILE, t:hdp name 
can be a^'taost five characters long. The fiftrst character must bejalpha- 
betic although numbers may be used in the other positions (e.g., uPSOl). 
This name is the same one under which the datja file was stored ear\Ler. 

That portion of the. file which is to be analyzed during the current ^run 
is chosen by the useir's response to the next question: 

/ DO YOU WISH BREAKp<^ BY: . ' ' 

(1) OCCUPATION > 

If a file contains records for employees in more ^than one occupation, 
these occupations should be analyzed separately by choosing this option. 
LPFILE will ask for the desired occupation code and select for analysis 
only the records of employees in that occupation. The other occupations 
may be analyzed by re-running LPFILE until all the- occupations have been 
con^leted. 

« , (2) GRADE .... 

If you feel that there may be differences in the turnover curves for 
different entry-grade levels (although the effect of grade at hire on 
retention is touch less significant than the effect of length of serv^ 
this option may be used. If there, is more than one occupation iij a file 
then this option alone will analyze all employees in the selected entry 
grade regardless of occupation. 
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' Figure 7-9^ 



ENTER FILE ENOING DATE (MDN:[.H, YEAR) 
I 2. 7h ' 

ente!R~The Dumber of employees -in the file 

60, ' ■ . " ^ 

Er^TER THE NAME OF YOUR TURNOVER DATA FILE 
(MUST BE LESS TH^ OR EQUAL TO 5 CHARACTERS) 
LP322 . . 



DO YOU WISH BREAKDOWN BY: 
(1) OCCUPATION 
• ■ (2) GPADE 

(3) BOTH OR 

ik) NONE. RUN WHOLE FILE 
(ANS I , 2, 3 OR '♦) 

k 



FOR THE ENTIRE FILE: 

THE L-P EQUATION 1$: 
= 0.'»635C + ( -1 . 35693)X 

AND THE STANDARD DE V I AT I DN . I S : 0.0'*116 

< WRITE DUT ACTUAL AND CURVE VALUES! (Y OR N) 
Y_ 

THE RETENTICN VALUES ARE:'\ 



L.C.S. ■ 


-PERCENT 


RETAINED- 


(YEARS) 


-ACTUAL- 


-.CURVE- 


0.25 


0.891 


0.9Q0 


0.33 


0.870 


0.867 


0.'»2 


0.830 


0.336 


0.50 


0.808 


0.808 


0.58 


0. 765 


0. 783 


0.67 


0. 760 


0; 759 


0.75 


0.735 


0.737 


0.83 


0. 708 


0-. 716 


0.92 


'0. 702 


0.697 


■ ■ I .00 


0.696 


0.678 


\.os 


0.689 


0:661 


I . J 7 


0.659 


■ O.SkS 


1.25 


0.651 


0.630 
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Figure 7-9 (cont'd) 



l.'»2 
1 .5C 
1 .5? 
1.75 
1 .83 
1 .92 
2.00 
2.. 08 
2.17 
2.25 
2.33 
2.42 
2.50 
2. 5 P. 
2.67 
2.75 



83 
92 



3.00 



3.08 
3.17 
3.25 
3.33 
3. '42 
3.50 
3.58 
3 . 67 
3.75 
3.83 
3.92 
I4.OO 
I4.O8 
i». 17 
i».25 
**.33 

^4.50 
'4.56 
^4.67 



0^6 f 9. 
0.63'4 
0 . 600 
0.590 
, 0.55> ' 
0.568 
0.528 
0.5*4 3 
0.500 
0.515 

o.'.fig 

0.i*52 
0.^4 33, 

0.^4 1*4* 

0.'429 
0.^407 
0.^4 23 ■ 

.'4'4Q 

.375 
.391 
.36*4 
0 . 3 3'3 
0.350 

0. 3.68 

0.389 
0.^12 
0.375 
0.^400 
357 
,308 
.333 
, 36*4 
0.^4 00. 
0.333 
0.375 
0.^29 
0. 500 
0.^400 
0.250 



0, 

0, 
0, 
.0, 



0 
0 
0 
0 



.0.602 
■ 0.589 
0. 576 * 
0.553 
0.5*42 
-0.532 
0.522 
0.512 
0. 50 3- 

0.'49'4 

0.'4S6 
0 . ^4 7-7 
0.'4"70 
0.^462 

0.'45'4 

0.'''47 

0.'4'40 . 
0.'43'4 

0.i4-27 
0.^421 
0;'4'V5 i 
O.I4O9/ 
0.^403 ' 
0.. 397 
0. 392 
0. 386 
0.381 
0.376 
0.371 
0.3'67 
0.362 
0.357 
,0.353 
0.3*49 

0.3*4*4 

.0.3*40 
0.336 
0.332 
0.328 



AGAI-N WITH SAME FILE? (Y OR N) 

AGAIN WITH /iNOThER FILEV ,( Y OR N) 
Y . 
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(3) BOTH 

Under this optidn, employees in the selected occupation who also were in 
the selected grade at hire are analyzed. 

• (4) NONE. RUN WHOLE FILE ' 

This option should be used for files that contain only one occitp^tion'. 
Or, for fileer^that opntain only occupations which^have been previously 
ana]^zed and found to be compatible (see the next chapter). 

LPFILE theto uses the selected records to c^culate a log-probabl'lity 
equation^ and staudard error, both of which are printed out. The user^ 
h^ Jfhe option to write out the actual retention pairs and 'the corres- 
^i>ondltlg curve values. 

After each run of LPFILE, the user may recycle either (a) to do another 
analysis on the same file or (b) to analyze a different file. 

Additional information about the development and use of" LPFILE can be 
^found in its Technical Analysis and Operation Manual . 
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- CHAPTER 8 
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STAFFING NEEBiS PLANNING COMPUTER PROGRAM; J 



• 1 



LPTEST 



We tiave said that retention analysts is ideally performed by occupation 
using longitudinal data. We saw ' i]n^ the preceding chapter «cases where 
Ij&ngitudinal data techniques are c^ot directly applicable. These cases 
led to the developmyit of LPFILE. * There are also situations where 
either (a) toalysis by individual occupations becomes difficult or (b) 
analysis within individual occupations is desirable. For such situa- 
tions, there is a third staffing needs planning computer program known 
as LPTEST. - . 

When To' Use LPTEST * ' 

" ■ • 

I,PTEJST deals with* the concept of the existence or nonexistence of differ- 
ences between or within occupations. It statistically compares tljeif 
retention trends to determine whether or not they may he grojuped together 
for analysis and planning purposes ('Such as for input to LOGPRO)'. 

There are two types of analysis which can be done using LPTEST. These 
two areas can be characterized as Inter-occupational and intra-occupa- 
tional. 

Inter-occupational analysis : For this type of analysis, LPTEST compares 
the retention curves of different occupations to see if they :have the 
same or similar retention patterns . Those that do can then be aggregated 
into a single planning unit. 

This analysis capability is useful for organizations consisting of 
several occupations with only a few employees in each. Such a situation 
means that there will be only a small number of hires into any one 
occupation during a year. This in turn means, as we saw in the preced- 
ing chapter, that longitudinal analysis for any one occupation would be 
, imprecise. However, it may be that some of these occupations have reten- 
tion patterns which are similar enough to allow for their grouping 
together. Such a grouping can be considered as one occupation for 
longitudinal tetention analysis and projection purposes. 

It may also be the case that, after having separated. out its major occu- 
pations for analysis, an organization might want to combine some of its 
smaller occupations into one or more larger groups which can be analyzed 
using log"»-probability techniques. 
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. As coroilary '^tJ this pf ocess i LPTEST may " be used to test the retention 
patt^Fns of the " same bccupatio^'^at times. That is, it may 

compare, the ciccu|at ion's retention ^ur;?!^ as ca from one year's 

dbhort group with that "^calculated from 'another year/s cohort group. 
Su^ a cotag^Msol will show whether there, has been any change in the 
occupatidn"^8ret^tlon pa«^ , 



jof ^grouped occi^^ations will utilize one lls^^^^j^robability 



efient the entire g/oup. 



An^^lysis for a 
equation ta repi 

\. I . ■ ' ^ 

Intra-occupatiojial acialysi^ ; Far this type of analysis, LPTEST compares 
the retention curves of subgroups of the, same occupation to see if they 
have dif f ereii^ tet^ntibn piat terns. 

A subgfauf of dn occupation consists of employees from that occupations 
who fit into ariy dj^siretf category. Some of the categories may be sex,^ 
minority *statu^, ^ade fait hire, training received, etc. For example, 
LPTE§T may be J^sed within an occupation to compare the retention patterns 
/Of male vs.^.fepiale emplayees, minorities vs. non--minorities, GS-5 vs^.. 
GS-7 Hires, those given special, trainiiig v£ untrained controls, etc. 
a difference lis discovered, the affectjed subgroups can be plaiined for 
separately (sjpatate log-probability liquations Wjill be availaibie). 

The technology used to make such comparisons is also useful in other 
areas of personnel management. There will be more about this in the 
• next chapter 



If 
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There is an Ifmportant point '3bout such comparisons which should be ^ 
emphasized hJre. And that islthat they should be made foi; subgroups 
within the sime occupation . It is not valid to say,^ for example, that , 
there i6 a d:.fference in the turnover rates pf men and women if the 
rates compare td^ come from different occupations. (Remember that different 
occupations lave different turnover raises •) Such rates should be compared 
within occupations where both males and .females are strongly represented. 
This will reaove the differences in turnover rates ^ which are solely the 
result of the differences in occupations. , 



Statistical 



Differences 



I.PTEST coifipSfes two or more occupational retention curves to determine 
if all or some of the occupations may be grouped -together . When looking^ 
at retentio^i trends, LPTEST is searching for statistica lly significant 
differences^ The underlined phrase leads to two questions: 



1, What constitutes a difference? 

^ . ) ' I. • ' 

2. What is meant by "statistically significant"? 
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'Question 1 > There are two oonditions which alqine or In combination can ^ 
cause occupations to have dl'fferlng retentl^ tirends* They are (a) 
differences in the first-year turnover rate and/or (b) differences In ^ 
the log-probablllty annual loss rate* 

Figure 8-1 shows the retention curves for a professlpnal occupation 
(Curve I) and a clerical' occupation (Curve IT). It ^enis obvious just 
by looking at these two curves that they are different I For one thing, 
their- first year loss rates are decidedly different (4^ for Curve I vs « 
42Z for, Curve II). -Secondly, Curve II loses 383: (- 58^" 20r) of Its 
cohort frgm year 1 to year 7, while Curve I loses- oniy<2oZ (■ 83Z - 
55%). This means that Curve II has a faster log-prob^blllty loss rate 
per year than does Curve I. ' . • » 

Another way^ of examining the difference between two curves Is by plotting 
them on a log-probablllty nomograph. This Is done for the two sample ^ 
curves In Figure 8-2. As ycJtl can see. Line I starts and remains above 
Line II. Also, the distance between the two lines Is growing larger as 
time passes.^ There Is a 25% difference between the first-year retention 
rates of the twoJ-ines, This hullds to a 36% difference at the seventh ^ 
year. In other wrds, the two lines are diverging (getting farther and 
farther apart). 

These kinds of differences can also be seen by looking at the log- 
probablllty equations for the occupations tested. For example, the log- 
probablllty equations for the two curves in Figure 8-1 (using the first 
four years of data) are: 

Curve I: y - 1.15682 - l-l/278x 
Curve II: Y ^ 0.45692 - 1.49392x 

First, you can see that the A-terms of the eq\iat ions are quite different. 
This term by Itself represents the normal curve' standard deviation value 
for the /time t = 1 (since x = Log t and Log 1 -0). The first A-term, 
1.15682/ transforms to 88% while the second, 0.456^2, translates to 68%. 
Thus, Curve I has a higher flrst-^year starting point than does Curve II. 
(This difference Is reflected on the nomograph in Figure 8-2. Both of 
these lines were plotted using a 0.5 averaging factor so that the values 
on the flrtet vertical line are for t ^ 1.5.) 

The second, or B, term of th^ equation represents the slope (or steep- 
ness) of the log-probability line. Looking again at the lines on the 
nomograph, you can see that Line II is steeper than Line I. This differ- 
ence is reflected In the B-terms of the log-probability equations for 
these two lines, since the absolute value of the B-term fqr Curve II Is 
larger than that for Curve I. 
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Figure 8-1 
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Years 
Since 
Hire 

i:5 

2.5 
3.5 
A. 5 
5.5 
6.5 
7.5 



Percent 
Retained 

62.9 
75.5 
69.8 
65.2 
61.4 
58.1 
55.3 



Equation: 

Y-ia5682 - 1.17278 X 



Years 
Since 
Hire 

1.5 
2.5 
3.5 

5^5 
6.5 
7.^5 



Percent 
Retained 

57.7 
44.5 
36.1 
30.2 
25.8 
25.4 
19.8 



Equation: 
Y-0. 45692 



1.49392 X 
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Such l^Bp6ctian techndlaues are useful^ in determining occupation^: 
diffiereiniieis whett, occupation*^ bbvlously different retention 

Ikiweyer,, jin^a^ will need tP be compared 4d.ll be 

■eli&ser^ occupations. The statistical 

niqiie uk^fl-iy be liiBC&il in these cases. 

-3Cia tiPTEST nO^y^^ be uaed^^ t^ more than two oqictitpatioric 

which woiuld be .difficult tp accomplish visually. '■'■l-''' ^* T . 



jtence <>f^;: fii' dlf 




ce , howeyer, is pot 
7 not be grouped to^y 



t whether the diff 



Question .2 . Just the 

evidencetfi/to say that^^^Clfe tested 6c6upatJ 

• tt' Is^iecepsat^y to /^p pne step further 

* *i8 Btatistlcially d^igxiif^tofe/ 

/First it i^buld'^eija.a^ when we compare bccupatiofas usititg 

TO are teeftlng to see if /the ii^dividual ^sets of retentl-gii dai|a:i|^ ^^ -i*. 

^samples ^wMich cQtae from the same overall popijlation^i H^'^IEHey <id,^^ttt^^^^^^ 
tj)py' may^^-gt^ ^ ^ ' - 




j No two sets of dattfJ^re ever going to. be exactly alike. Thejt;e?will 
always be some difference yjiich is due to the action\of chande fact'ors. 
What is needed^ is a criterion, to deTOi;plne when the occurrence of a 
dlf f^jence means that the compared sets come from different populations. 
Thiia- criterion is known as statistical significance. Statisticians have 
defined differences as being statistically significant when the ptoba- 
biliiy that their occurrence could be attributed to chance is 1 in 2p 
(or ^ess) . , 5 

To determine what probability value is associmed yith a given, difference, 
the difference must be quantified. This is done by first setting up an 
hypothesis which assumes that no difference exists (the "null" hypothesis) 
and then "trying to disprove it using an appropriate statistical test. * . 
This test will result in a number which will be Associated with a pre*^ 
determined unique probability value. L ^ 

By definition, if this unique pre-^determined probability value is less 
than 0.05 '(» 1^20), then a statistically significant difference will 
exist. If the probability value is less than 0.01, then the difference 
is considered to be highly significant. * These probability values arey 
also icnown as sifenificance levels . ^ 



Statisticians hayi^^rfced out formulas and developed tables which enable 
one to determine what probability value is associated with the result 
achieved by using a giv® statistical tefet. These tables can be foun< 
In any statistics boo 
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The/ Stl^tlstlcal? Test , The .^election of a'?8tatlst:^,cal test shoul'd be 
carefully madie. There are fiany such 'tests and ekth one should be used 
only Itr the sltuatlonCs) /£or whlch/it was, deBlgpfed. • ^ ^ „ ^ !. 

Fpr Its coxiiparlsons, LPTEST uses the "*^-statlst^^ This test , 

compares two- variances (by dl^vliiing .the larger 'vakLsince by tt^.^all^ij} / 
to get value^hlch is knifwri as t'^ . " - If . thi.e^^lUe y 
then the dlffereiice betwei^ .t^e\ two; ^^a'irf^ Is considered tov Be ^Ig-^'^ 

■;n±fi;cStit^ '^%L..\. . ■ ' ■ > 

• ■ ' ' * . . . . • 

The significance level associated with] a given value of F can be Inferred 
-from statistical tables or approximated using mathematical eijuatlons. 
LPxtST contains a subroutine using one such mathematical approximatioi^ 
This subroutine uses a calculated value for F plus what are known as the 
associated degrees of freedom figures for each of the two '^revipusly- 
calculated variances. Qener ally speaking, degrees of ' freedom is* defined 
as the number of data it^s in a sample minus one. In LPTEST, the number 
of data items equals the nuSiber of items from which a variance is calcu- 

>t^^ted. ■ ■ " , 

The LPTEST Method 

LPTE ST makes use of longitudinal data to comftare occupational retention 
trends. These data are in the same format ad those which are entered 
into LOGPRO. Thus, you need to enter, fpr each occupation to be tested, 
the number in the starting group and the number or percent retained at 
later points in time. LPTEST also uses the same statistical transforma- 
tions and basic methodological assumptions that are used by LOGPRO. 

LPTEST can also use data which come from an LPFILE program run. This 
is done by using LPFILE to compute an equation for the occupations 
concerned and replying "YES" to the option to write out the actual ^and 
c\irve values. Then you can select from these lists actual percentage 
values for a few time values (using the same time values for each occupa- 
tion). These can ^. then '^be entered into LPTEST. Since the retention .(or 
' y-) values will already "be percentages, aiiy convenltent values can be 
entered into LPTEST as starting group figures. ' 

As was discussed at the beginning of this chapter, LPTEST type data are 
either grouped by occupation (for inter-occupational. analysis) or by 
subgroup wlthip an occupation (for Intra-occupatlonal analysis). If aliy 
set of groups or subgroups Is^ found to be compatible, they may tfien be\ 
combined for further analysis. Fox example,, combined longitudinal data 
can be entered into LOGPRO for analysis and projection. •> 

Any group of occupations which is found to be incompatible can be ' 
"regrouped"— i.e. , one or more of the occupations can 'be removed from 
the test group and the rest can be run through Vhe program again. This 
process can be repeated as often as the user wishes. 
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To test the. inputted longitudinal data points for any differences, 
LPTEST sets up the null hypothesis that there are po differences among 
the retention trends of the Inputted occupations. Then it calculates ^ 
a value for the F-statistic and determines the probability associated, 
with that value of F. 

To determine a value for the F-statistic, LPTEST uses what is known as 
an analysis of variance technique. This -involves using all of the 
inputted longitudinal data to compute two. specific variance figutes: 

Va = The variance among the inputted groups 

Vw = The variance within the inputted^ groups 

The first vari^ince, VA, represents the differences among the inputted . 
"^groups (or how the groups differ from each other). The secbnd, Vw, 
estimates the variation, that occurs within each group.* The F-statistic 
is calculated using the'^ormula: 0 * . 

^ F = Va ^ ■ • . 

Vw 

If. the null hypothesis is correct, then there should be little or no. 
difference between the two calculated variance figures. In other word^, 
the value of F would be fairly close to 1. Howevet, if a significant 
difference exists, then the variation among the inputted groups will be 
considerably larger than the variations within the "individual groups. If 
this is so, then the value of F wii;L be large • 

The degrees of freedom figure ks bo ci At ^d with Va (Na) is equ&l to the 
number of groups tested minus one. The degrees of freedom figure associ- 
ated with Vw (Nw) is the sum of all the longitudinal data points entered 
minus one. For example, if tli^ee occupations are tested' and each occupa- 
tion has four y^f^rs of tent ion data, then 



The values of F, Na, and Nw are entered into the F-Bpproxlmation sub- 
routine (FTEST) which returns the associated probability value to LPTEST* 
If this value is less than 0.05, then the differences among the Inputted 
groups is statistically significant. 

LPTEST also calculates the log-probability equation : for each inputted 
group, if it is determined that one or all of the inputted groups 
should be analyzed separately, then the log-probability equation(s) 
calculated by LPTEST can be used in any 'further analysis. If, on the 
other hand, the tested groups are^found to be compatible, LPTEST calcu^ 
lates the log-probability equati^ which represents the combined groups. 



S«Bpl<ft Rmi of LPTEST ' ^ . 

liPTBST ±8 designed to accept longitudinal retention data for up to twenty 
occupations or groups. These data are entered during the £irst- or in- 
formation-phase of an LPTEST run. A user may select all or any subset 
/ of tlie^ entered groups for testing. The test results are printed out . 
dutlng the analysis oiitput phase , of LPTEST. 

During the Information input phase of a run, .LPTEST asks for the follow- 
ing pieces of infor^iation: 

(1) The nuinber of groups (e.g., occupations) which are to be 
^ - tested. ' ' ■ 

(2) The occupation or series code for each inputted gtoup. (If 
subgroups of one occupation are being entered, each one will 

/ . need a numeric code.) 



<3) The number of time (or x-) values for which there are retention 
' data. Each group' tested must have the samie number "of 
values. 

(4) The^ value of each x-value. Each group tested must use the 
same x-values. 

<( • ' ■ * ' . - 

^(5) The starting population for each cohort. 

(6) Whether the retention (oi; y-) values are in the form "number 
of employees retained" or "percent of employees retained." 

(7) The actual retention values. 

The data fot item (7) are entered by x-value. That is, all of the y- 
values associated with a given x-value are entered on one line and 
separated by commas. The order and number of the retention values on a 
line ife the same as the order in which the occupation codes were entered. 

In, addition, LPTEST asks for information to determine which of the inputted 
groups are to be tested at this time. First, it. asks if the user wishes 
to test (1) all of the groups or (2) only some of the groups. If all of 
the groups are ta be tested, * LPTEST goes directly to its analysis sequence. 
If only some of the groups are to be tested, LPTEST queries the user as to 
how many groups are to be tested and ^ich ^nes they are. . 

Figures 8-3 through 8-7 show a sample analysis sequence using LPTEST. Five 
different occupations are entered and comparisons are made using three 
different sxibsets o]^. these occupations. The sample data used are: 
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OCCN PERCENT RETAINED AFTER 

CODE N * 1 Yr. ' 2 Yrs. . 3 Yrs. 



\ 



322 30 58.001 44.00% 36.00Z 

312 25 ' 52*73 40.00 32.73 

201 62 83.20 74.50 70.30 

212 50 80.00 70.00 61.00 



1520 15 ,9^2.00 87.00 85.00 

Figure 8-4 shows the order In which percent retained values are entered: 
first, each occupation's "1 Yr." values are entered (in the same order 
as the occupation codes were entered)* then, all of the "2 Yrs." values, 

etc. y 

Since the sample occupations are a diverse mixture, they would seem to^^^ 
fall into at least two obvious groups: clerical (322 and 312) and EAT— . 
(201, 212, and 1520). Using this breakdown^ the first teat made was a 
comparison of 322 and 312. (Note in Figure 8-5 that to make thio^^, selec- 
tion, we entered "1, 2." These are the values that were assigned to 
these groups during the entry of occupation codes.) 

The result of this first test can be found in Figure 8-5. 
see, this output shows which occupations have been tested and stAtep v / 
that they may be grouped together. The log-probability equation f 91: 
each group tested is also printed out. In addition, since these two 
occupations were found to be compatible, the log-probability equation of 
t'heir combined retention values is printed (under the heading Total 
Group Equation"). 

Next, a test was made to compare 201, 212, 'and 1520. The analysis 
output In Figure 8-6 gives the result of this test: these three occu- 
pations may not be grouped. Since It seems likely that occupation 1520 
Is the one which Is gumming up the works, this occupation was removed 
from the group and another test was made. The result of this last test 
is shown in Figure 8-7. The two occupations, 201 and 212, may be grouped 
ao their total group .equatio^ is printed out. - 

. So, from this analysis of five occupations, we came up with three group- 
ings: 322 and 312; 201 and 212; 1520. These groupings may be used in 
further analyses involving these occupations or the occupations may be 
analyzed indlyidually . Either way» the ipg-probabillty equations needed 
are, produced by LPTEST. 

After each run of LPTEST, the user may recycle back to the beginning to 
perform another comparison by either using the same occupations or 
inputting new occupatl^^)^. 

Additional irif ormatlon abotit the development and use of LPTEST can be 
found in its Technical Analysis and Operation Manual . , 

i/pAT is shorthand for Prof dsgi^gdl. Administrative and Technical 
occupations. \ 'vN ' . 
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^ Fi/gUre 8-3 ^ . 

mis PROGRAM 'ANAMtIeS' AND CCrPARES THE RETENTION TRENDS OF 2 OR MORE 
XCUPATICNS TO CETEPMINE WHCTbER THEY CAN 3E GRLUFE^J TCCETHEft TlR, LlG- 
PR06ABILITY ANALYSIS. • . 

FOR PURPOSES OF THIS PRCGRA^1: 

- (1) THE "X. - VALUES" = LENGTH OF SERVICE COMPLETED; AND 
- (2) THE '"Y - VA«S» = NUMBER (OR PERCENT) RETAINED AT TIME X. 

SEE INSTRUCTION MANUAL FOR FURTHER EXPLANATION CF DATA. REQUIRED. 



ENTER THE NO. OF RETENTION G^^OUPS TO BE COMPARED 

- • . 

EN-TER THE OCCUPATION CODE FDR GROUP ^ 

Nd. i: / 

322 , • ' _ / 

NO. 2: 
3J2 ' 

NO. 3 - . . 

201 

NO. h'* * 

7 1 2 

|N0. 5- 

1 52C 



3 



E N T F f ^ y \/ A 1 1 1 f 

NO. \ ' 
1 .5 

No. ?' 

2.S 

ho. 3 = 



1]^ - 



FlRurc 8-4 \ 



ENTER STARTING POPULATICN (fJ) FOR GRDUf^ 

NO. i: ( 322) * 
20 

NO. 2'- ( 312) 
25 



NO. 35 ( 201) 
62 

NO. 4: (212) 

5 0 

NO. 5: (1520) 
15 



ARE Y- VALUES IN 

( 1 ) NUMBER OR 

(2) PERCENT FORM? 
(ANS I OR 2) 

2 



IN DECIMAL FORM, SEPARATED BY GUMMAS 
ENTER THE Y-VALUES CCRRESPGNDING TO X = 

1.500 b: ' 
.58.. 5273. .632, .80, .92 

2.5000: 
r^i*, .^4O..7^45..7 0..8 7_ 

3.5000: 
. 36. . 32 73 . . 703. ^. ' , 
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DO YOU WISH TO TEST (1). ALL CR (2) SOME OF THESE -^onuPS? 
(an: 1 LR 2) 

7^ , ' ■ y ' ■ 

HOW MANY GROUPS DO YOU WISH TO TEST2 

WHICH ONES? ENTER USING THE GROUP NUMBERS V 
ESTABLISHED DUPflNG THE ENTRY OF OCCUPATION CODES. 
(SEPARATE THESE VALUES WITH COMMAS.) 
1.2 ■ 



XXXXX^XXXXXXXX)CXXXXXXXXXXXXX)}XXXXXXXXXXXXXX 

X -.--LPTEST ANALYSIS dUTPUT-^- X 

X • • . / X 

y ,.■./■■ X 

X JLhE 2 OCCUPATIONS TESIEC X 

X 322 fx 

X '312 X 

X MAY EE GROUPED TOGETHER. X 

X 

X .. X 

X INDIVIDUAL StiSGRCUP DATA: .X 

X X 

X --L - P EQUATICN-- X 

X ' CCCN \: (A) (5) X 

X --- V X 

X ^ X 

X 322 30 0.'t6709 -1. 52832 X 

X 317 7 5 0.3 1322 -l.'iCSS? X 

\ . ' X 

X ' X 

X TOTAL GRLUP FQUATICn: X 

X ... X 

X Y - 0. 3°f>5it - 1 , '*70/2X X 

X ^ , ■ X 

X X 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXNXXXXXX^XXXXX 
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DO you" WISH TO TEST ANOTHER SUBSET OF THESE GROUPS (Y OR N)? 

DO YOU WISH TO TEST (1) ALL OR (2). SOME OF THESE GROUPsfl 
(ANS I tR 2) 

i- ' . • 

how MANY GROUPS DO YOU WISH TO TEST? 



WHICH Or^ES? ENTER USING THE GROUP NUMBERS 
ESTABLISHED DURING THE ENTRY OF OCCUPATION CODES. 
(SEPARATE THES£ VALUES WITH CO.^MAS.) 



1 



XXXXXX)CXXXXXXXXXXXXXXXXXXXXX>.XXXXXXXXXXXXX 
X , ^ 



X LPTEST 

X 



ANALYSIS OUTPUT- 



X 

' X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 

'X 
X 
X 
X 
X 
X 



ThE 3 OCCUPATIONS 

201 

212 
1 520 
rAY NOT 6E 



TESTED 



GR'OUPED TOGETHER. 



INDIVIDUAL SiiBGRfiHP CATA: 



OCCN 



201 
2 I 2 
1520 



- p 

(A) 



EQUATION-- 
(6) 



62 
50 
1 5 



15866 
\ U98 

,570^0 



- 1 
-^1 



.18236 
.52008 
.0225'4 



X 
X 
X 
X 

X ' 

X 

X 

X 

X. 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

X 



XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXJXXXXXXX 
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Figure 8-7 



. ■ , r ■ ■ ■ ■ ■ 

00 YOU WISH TO TEST ANOTHER SUBSET CF THESE GROUPS (Y ORN)? 
Y" 



-DO YOU WISH TO TEST (1) ALL GR (2) bo.iE OF THESE GROUPS? 
(ANS 1 OR 2) > 

2 -"^-^ X 



HOW MANY GROUPS DO YOU VvISH TO TEST? 
2 



WHICH ONES? ENTER USING THE GROUP NUMu^..^ 
ESTABLISHED DURING THE ENTRY OF OCCUPATION CODES. 
(SEPARATE 'THESE VALUES WITH COMMAS.) 
3 , ■ 



XXXXXXXXXXXXXXXXX.XXXXXXXXXXXXXXXXXXXXXXXXX 

X X 

X . X 

X LPTEST ANALYSIS OUTPUT X 

X X 

X X 

X THE 2 OCCUPATIONS TESTED X 

X 201 * X 

X 212 X 

X MAY BE GROUPED TOr,FTHFR. ' X 

X X 

X * X 

X INOIVIOUAI ^'jRr.rjrifp f.ATA: ,x" 

X X 

X ^1 - P FOUATICIvi-T X 

X OCCN N > (A) (6) 7 X 

X - - - - V X 

X X 

X 2r01 62 1 ,1 5866 - 1 . 18236 X 

X 2 12 SO 1 , 11 '*98 - 1 , S20'08 X 

X . X 

X . X 

X TOTAL GROUP EQUATION* / X 

X /X 

X Y = 1 . 1 3857 - 1 . 33^1 IX X 

X X 

X X ^ 

yxxx:?"xxxxxxxxxxxxxxxxxxxxxxyxxxxxxxxxxxxxx 
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CHAPTER 9 

■ ■ ^ 

STAFFING NEEDS PLANNING COMPUTER PROGRAM; 
GS810 



!Ehe.Need for Analysis of Advianceinent 

Having covered the detailed methods of analyzing and projecting turn^ 
oyer, thiB' Xoglcal next 'step Is to show how to utilize this - techiciology In 
estimating hiring needs. At this point, however, while the detailed 
dl8Cui38lon of the LPTEST program ±& still fresh, we Vant to. show how an . 
Hidaptatlbn of LPTEST techniques for determining differences can be 
applied to the analysis of grade- advancement patterns. 

Advancement, of course, Is one of the moat fundamental of . all persotmel 
mxyvements. And the goal of advancement based on merit and fltnjess for 
the work of . the service Is one of the most fundamental goals of the ' 
Merit System Itself. Apart from Its Intrinsic interest, however , there 
are^^ number of specific purposes for developing methods of analyzing 
adyaroement patterns. 

One purpose ^Is. for occupational studies. An occupation's advancement 
pattern 1$. one of that occupation's most characteristic and d^I-stinctlve 
features. 3elng able to analyze advancement patterns and to distinguish 
between the iadvancement patterns of different occupations are important 
aspects of occupational analysis work. 



A second ^ifl^rp^se of analyzing occupational advancement patterns Is for 
use In employee career coinisellng activities and — along with data on 
Inte^-'occupatlon mobility trends — ln*the establishment of lines of 
promotloh for. setting up merit promotion plans. 

A third reason is for the purposes of research. Being able to analyze 
and compare advancement patterns for different groujplngs would give us a 
powerful tool,^or example, for comparing the effects on advancement of 
such factors as differences in personnel characteristics., or the effects 
of different kinds of ttraining programs. Advancement of employees who . 
have advanced degrees, for example, versus those who do not. Or advance- 
ment-of employees given special post-entry training versus those who 
were not. And so on. 

A fourth purpose for analyzing and comparing advancement patterns is to 
idkntify individuals and/or groups whose advanc^ementj is significantly 
above or below the norm for their occupational group. Identifying 
employees whose advancement is much above average, for example, can be. 
very useful in programs for identifying potential future executives. 
Identifying employees with below-normal advancement, on the other hand, 
can be useful in remedial training, performance evaluation, and similar 
programs. < 
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Finally V GS8J.0 can be of use in equal employment opportunity programs, 
in comparing a4vancement patterns of minority groups with patterns of 
non-minorities . 

Equal employinent opportxinity, of cAurse^ is .an Important responsibility 
both of government— Federal , Sta^ and local—and of private industry as 
well. Most major employers, in fact, devote substantial and continuing 
efforts toward the establishment and Improvement of their EEO programs. 

Data analysis usually is a key element of all of the above-mentioned 
programs. It prot^ides objective means of assessing program progress. 
It can help to pinpoint program areas which may Ije.in need of improvement 
It can show the effects of past actions. And it *c an indicate the likely 
future effects of cu^rrent actions.- 

We believe. that in LPTEST we have,ym analytical technology which can be 
of significant help in such analysS|||| 

Ba:^riers To Analysis 

There. are truly fortaidable barriers, however, to applying LPTEST tech- 
niques to the analysis of advancement data. Let us cite just a few. 

First, we have established through research which has followed group 
.grade advancement over many years that the mean grade of a given group 
of, say, GS-5 hires, rises over time along a log-normal curve. This is 
just like the L-P (Log-Probability) Surnover curve described earlier 
except that it go^s up instead of down, as in t'-.- following /illustration, 
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The significance of the fact that this is a rising curve lies in the 
neceasity-r-vhlch wa^ explained^ earlier in the transfoxnnation of L-P 
curve data — for converting y-axis data into "Percent of Maximum Possible 
Value of y" and thenc^ into standard deviations, for L-P curves, •'we 
could do this because, we knew the maximum possible value of y: the 
number of employees we started out with. Because advancement curves are 
rising curves, however, we have no way to determine what the mflY-ftniim 
possible value of yj^ight be. Thus, we have no way of converting o\\t 
data into the "standard deviation" form needed for least-squares fitting. 
The curve fotm used in LPTEST, therefore, cannot be used for advancement 
data. ' 

Those with economics backgrounds might at this point suggest that we 
substitute for the L-P curve form one of the standard growth curves used 
in economics, such as the P^arl-Reed logistic or the Gompertz: 

(1) 1/y « a + bc^ (logistic); or 

(2) log y = logA + logb(cX) (Gom^pertz) 
* • • 

These curves are such, howerver, that they can be fitted to data only by 
approximate techniques; , least-squares fitting is impossible. If they 
were used, we could then not usfe the F-test of variance to permit com^ 
parison of one cuirve relative to another. 

The second major barrier to the use of the LPTEST technique is that of 
the availabiJlTty of data. The L-P curve technique would require longi- 
tudinal data on workforce advancement trends for, say, ten to twenty 
years past. Such data are simply not available; Thus, here too, the 
techniques of LPTEST cannot be directly applied. 

Finally, any analysis of advancement trends must have some means of 
scaling jobs by grade level; i.e., into nxmbered intervals, rather than 
the continuous variable y-values used in LPTEST. GS810 uses the General 
Schedule grading system used in Federal white-collar employment. The GS 
system is not used, however. In State or local government, or in private 
industry. To be useable o^'tslde the Federal government, then, some 
other means of qt^^I Idr io>> if^volc y-v«l "ee^-must also be provided for. 

GS810 Features . 

Fprmidable as these three barriers are, they are not insurmountable. To 
overcome them, four major adaptations of LPTEST techniques haVe been 
developed. These give our advancement program GS810 four characteristic 
features which distinguish It rather sharply from LPTEST arid which 
should be thoroughly understood. These are as follows. 

9 

First, GS810 uses census-type data, rather than ^ longitudinal data, in 
its calculations. That is, all of the data used in GS810 are collected 
at one point in time, rather than over several successive time joints. 
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Wc can illustrajte the difference this way. Suppose you wanted to study 
children's grovtii patt^ during their grade school years. If you were 
to use the longitudinal method, you would take a particiaar group of 
children just entering grade schoal, measure^ their heights at that 
poiht, a^d re-measure them annually thereafter until they le^ft grade 
s'Chool;-- ; . , - , "~ ^ , 

To use the: i^ensus method , on the other Iwnd, you yould measure all the 
children itv tihe school at the same tlmc^ Then you would group data from 
all first-graders together, all second-graders^ all third-graders and so 
on, and average the heights in each grade separately. The resultant 
curve formed^ plotting these averages would look much like the curVe., 
that would be gotten from longitudinal data, except that each year's 
observations would be of different children. ^ ^ .. 

The setond major difference between LPTEST and GS810 is that GS810 uses 
a different curve form for fitting to the data. The form used , is a 
variety of exponential: ^ 

y = a x'^ 

The reasons that this form is used can be summarized quite briefly. 
First, by means of logarithms, it cto be converted into the simple 
linear form: 

log y = log a + b log x— ^ 

This form can be fitted to data by straightforward least-squares techniques, 
and the F-test; procedures similar to those used in. LPTEST can be used ±n 
GS810 with full validity. 

Second, when the total time span covered by the data is Itield to not more 
than about 10 years—as compared to the 30-40 years that might be theore- 
tically possible ixL an old-time organization— this curve form fits grade 
advancement data so closely that it is statistically virtually indistin- 
guishable from what could be gotten using 'the log-normal curve form. 

The third major difference between LPTEST and GSSIO is that in GS810 the 
y-values data are in grouped-data form, rather than in continuous variable 
form. In L-P curve fitting, you will recall, the value of y— the frapction 
of the starting group still present — ^was a continuous variable which 
could take any value between 0 and 1.0, (0% to 100%). Thus, we could— 
and did— get values like 0.1275 (12.75%), 0.6733 (67.33%), and so on. 



1/ This form of linear equation is referred to as "log-log" since both 
^ the X- and y-values are transformed to th^it c^responding logarithms.^ 
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Grade level figures » however » are not continuous variables* There are 
no fractional gradei?. There are only a limited number of grades into - 
which the' full, spectrtim of job difficulty must be grouped* , Such groupr 
ings obviously ignore fine differences in job difficulty. Thus» a .job 
that bajrely reaches the GS-r3 level of difficulty is "GS-3," not "GS- 
3;01." By the same token, another Job., which falls just a hair short of 
reaching the GS-4 level, is also "GS-3/' not "GS-3.99," v 

Each grade-level number, in other words, stands for an Interval , "GS-3" 
stands for "GS-3.O0 to GS-3.99." "GS-4" stands for "GS-4;00 to GS- 
4.99.^* And so on. Grades, then, are jusl^like the ranges that are used 
when grouping continuous variable data t^ether according to intervals* 

To return to our grade-school example, for a momentV suppose we took all 
second-graders whose fieight we measured and we made up a grouped data 
table showing how many there are in the 46-47, 99"%interval (3 feet 10 
inches, but not 4 feet), how many are in the 48-49v99" intervaly and so 
on;* To make up a "grouped-data average," of course^ we would take the. 
ntimber of children in each interval, multiply this times the midpoint of 
each interval (47.00^, 49.P0'\ etc.), -and then divide the overall sum of 
these products by >tTie total number of ehULdren to get our desired 
average. \ ' ■ 

GS810 handles grades the same way. Since "GS-3" includes all jobs whose* 
difficulty falls in the range "3.00 - 3.99,^ the pidpoint of the "GS-3V 
interval is weighted "3.50." Similarly, the "GS-4". range midpoint is 
"4.50." And so on. 

A 

The "length-of -service" dimension is treated in exactly the same fashion. 
The employees who are in their first year of service fall in the "0.0 - 
0.99" interval. The midpoint of this interval is "0.50." TKfe^mldpoint 
pf the "1.00 - 1.99" years of service interval is "1.50."'' And so on. 

In GS810, then, all computations are done from grouped-data tables, with 
both x-axig' (length of service) values and y-axis^Xgirade) values made up 
of intervals. Such an 8 row by 10 column tabular format is an "8 by 10 
matrix," in computer parlance. (See sample below*) This is where the 
"810" part of GSBID' s name comes from. 

The '^*GS" part of the name comes from the fact that GS810 automatically 
provides correct weights for the two most common grade progression 
patterns- of the Federal "General Schedule" <GS) occupations: GS-i/8 and^ 
GS-5/15. (The program also permits entry of other weight patterns, if 
desired, for non-GS or non-Federal occupations.) ^ 

The foutth major difference is that although LPTEST and GS810 both use 
the F-statistic to determine if a' difference exists, the two programs 
use different methods to obtain the variances which are coiapared. 
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In LPTESt, the variances which are tested are those withi^ and between 
cblunrns. In GS810, on the other hand, the data for each individual efub- 
group are added together and a log-log least-squares line is fitted to 
the sumned (or total group). The resulting equation ri^presents the 
overall group norm. In addition, log-log least-squari^ lines are fitted 
to each Individual subgroup dtid three types of varianc^§s are calculated: 

, 1. The variance of the total group data from tije total group 
'equation; * 

' ■ ' •• * ■ . • , ^ • 

2. The variance of each subgroup' a data from jtlie total group 
. * equation; and - » . 

-3. The variance of each subgroup' s data from its own eqtiation. 

Al of these variances are calculated utilizing differences between 
actual and curve values as described in Chapter 3. - 

Three kinds of conparison tests are made using different .coroblixatiohs of 
thes^ variances. A value for the F-statistic is calculated for every- 
test. The te^ts are discussed lat^y. , 

Preparation or. Data * * , 

Jo assemble the data needed for a GS810 analysis, then, we would proceed 
like this. First, we would take all empl<)yees who have ten years or 
less of service an4 for each such employee, we would make up a. card "^ 
shoving:^ 

■ , ' , * 

(1) occupation 

(2) sex . 9/ - • 

(3) minority status ±f . 

(4) length-of-servlce since entry on duty - 

(5) grade 

Then, we would sort these cards into stacks by occupation — one^ stack yer 
occupation. The number of Btarks ve get will determine the number of 
* .GS310 occupational analyses wp can do. . • *, » 

To prepare an ocdupatlon. for. « r,^f^]0 stnalysis, we then sort each occupa-* 
^tipnal stack into four pllef?? ^ 

Minority Male (MM) 

Minority Female (MF) ^ 

Non-minor itY Male (NM) 

Non-minor ity^Female (NF) ^ 



1/For other phases of advancement research, data elements (2) and (3) 
may be replaced or) supplemented by other test characteristics (e.g.,-^ 
receipt of post-entry training, age-at-hire, educational level,' etc.) 
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We then take each pile and sort it by length-of -service into ten groups; 
0-0.99 years, 1.00-1,99 years, etcf, and sort each of these in turn by 
grade* Then, we count the number of cards in each block of our 8 x 10 
table and we rexrord the results in matrix form like this (entering 
zeroes in each block yhere no employees were found) : 



0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 




0 


1 


2 


7 


7 


1 


3 


3 


0 


0 


0 


1 


4 


4 


8 


2 


4 


7 


1 


3 


9 


13 


24 


i3 


21 


6 


i 


3 


31 


42 


51 


40 


48 


48 * 


28 


4 


6 


6 


119 


128 


109 


19 


27 


23 


17 


2 


2 


4 


82 


30 


10 


2 


1 


5 


4 


1 


3 


0 


10 


3 


0 


0 


0 


1 


0 


0 


0 


0 



Following the GS81U tcdmicai diiccLioi*o, we eutci these data into the 
computer as named files. For occupation i^99, for example, we would have 
files named ''NM9^9," '•MF999/' and so on. 

From this point on» the complete duaiyttlts Jot will be done entirely by 
GS810 in the pattern deuired by the program operator. 

Using GS810 

Since the GS810 Up.ii.ci Liou M<tavial pioviUeb a detailed, step-by-step 
description of GS810's features «nd optious, we will not repeat these 
details here. BUther, we would like to discuss the principal things 
that GS810 doeo and how it can help the analyst identify?'' the nature and 
source of the kind of problems that are typically encountered. 

At the outsets it must be clearly understood tl^at GS810 must be used 
only for the kind of comparisons for which it was designed: comparisons 
of the overall norm of the occupation of which the subgroups are a part. 
To put it another way, GS810 cannot be used to compare any subgroup with 
the norm of any group of which the subgroup is not a part. 

The reason for this lb inherent in GS810's methodology. It is an abso- 
lute requirement for GS810's use of the F-test that all of the data for 
the subgroup T^^ing tested must have been used in computing the overall 
norm curve and variance against which the subgroup is tested. Any 
departure from this requirement, however slight it may appear, destroys 
the validity of comparison results. 



With this preliminary caution firmly in mind, JLet us now look at what 
GS810 does in its comparisons and how^ it can help the analyst pinpoint 
problem areas. ^ 
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To begin vlth> GS810 tests each subgroup in three ways. Once to test 
the subgroup's grade-advancement ttend against the occupational notm. 
Once to compare the width of the subgroup's skill spread above and below 
th^ advancement trend line with the skill spread of the occupation as a 
^ole. And once toLcompare the total grade-time pattern of the subgroup 
with the total grade-time pattern of the overall occupation. ^ _ 

More specifically, the first test is an evaltxation of the value of the 
ratio: V 




variance of subgroup from overall group advance- 
ment curve 

variance of subgroup from curve computed from 
subgroup data only . , 

This test is designed to answer the specific question, "Is tlxe grade 
advancement curve of this workforce subgroup significantly different 
frop the bverall group's no.rm?" If the answer to this question is 
"Yes," the program then determin^Q. whether the subgroup curve is: 

"Higher" t-^ Both A and B terms of the subgroup curve equation are 
above the corresponding group norm equation values; 

"Lower" — Both A and B terms are below the group norm values; or 

^ "Different" — One value is above, one below. ^ ' 

When a "significant difference^' is found on this test, the analyst 
should carefully check the equation values In the "Results** table and/or 
plot out on graph paper both thet subgroup durve and the group norm 
curve. If the "A" term values are substantially equial, and the "B" term 
.values are conspicuously uneqixal, then a^plot of curve values will show 
the curves starting at -or near a common Jpolnt and then diverging progres 
^vely over time. This is the classic pattern of a group which is 
receivlng^clearly differential treatment as coiapared to the norm. 

if, however, the "A" term difference is as great or greater than the "B" 
term difference, the case is an ambiguous one. It could be the result 
of differences in. the gradefe;iistribution of new hires. Or, it could be 
the result of marked changes in hiring patterns during the t^n-year 
period of tl^ sample. Or, it could be the result of a number of other 
factors. When tlie two respective curves, then, converge or Intersect, 
sathef\than diverge from a common starting point, further Investigation 



where 



- . ^SG 



a2 

SS 
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* ±8 needed to determine the causative factors. No clear-cut, direct 
f Iniilng is possible. ^ 

The second test In the GS810 technic^ sequence Is an evaluation of the 
ratio: 




(whichever Is greater) , where 

° variance of overall group from group equation 

2 

^SS variance of subgroup from subgroup equation 

In this test, we remove advancement from both subgroups and overall 
group data to test the question, "Is the spread of subgroup employee 
grade (skill) levels above and below the grade advancement curve signi- 
ficantly different from the overall group norm?" If the answer to this 
question is "Yes," the program then determines whether the subgroup 
spread is "greater" or "smaller" than- that of the overall group. 

Generally speaking, a finding that the subgroup grade spread is "smaller" 
is more common than the finding of "greater," particularly when the sub- 
group involved represents a minority. In most cases, the /"smaller" 
spread means that subgroup employees are closely clustered around the 
^rade advancement curve with few or no employees being much above or 
below it, even in the first few years of service. Generally, this 
indicates that relatively few applicants who are qualified for above- 
basic-entry grades are available for hire in the labor market. Thus, a 
"greater proportion of minority hiring is at the basic entry grades than 
is true of non-minority groups. 

Wher^ tjie finding is "smaller,-* then, additional study of the proportion 
of subgroup above-entry eligibles in the labor market is needed to esta- 
blish the nattxre of the problem. 

A finding that the subgroup^ grade spread is "greater," on the other 
hand, almost always stems from the subgroup's getting a greater than 
normal proportion of its entrants by in-service accessions — e.g., the 
promotion of long-service clerical eiaployees into professional Entrance 
grades. This can be recognized in the subgroup's grad^-tlme matrix by a 
greater than expected proportion of eiaployees showing up in the eyitry 
grade(s) in the later years of service (6th, 7th, etc.). 

A finding of "greater" spread, that is, is a common result of effective 
upward mobility programs. This effect can be readily cross-checked by 
redefining the grade-time Aatrix substituting "length of. service in this 
ocCupa1:ion" for "length of service since hire." If the, "greater" iElnding 
is the result of in-service hiring factors, this* will cause the difference 
to disappear. 
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The third and final test of GS810 is a test of the overall subgroup 
matrix against the overall group matrix. This involves" the valuation of 
the ratio: 




where " " " 

" <^^G = variance of the subgroup from the pverair group 

equation 

« variance of the overall group from the overall group 
equation ' . 

<, 

This test is designed to answer'the overall question, "Does the overall ; 
subgroup grade/time pattern which results from the combined effects of 
(a) the subgroup grade-advancement"^ curve and (b) the subgroup's grade 
spread, significantly differ from the overall pattern of the occupation 
as a whole?" 

The findings possible on this question are either "different" or "not 
dlff^ent," In almost every case of a "different" finding, a "signifi- 
cant difference" finding will have shown up on one or both of the first 
two tests. Essentially, then, the third test measures the overall 
effect on the' subgroup grade distribution pattern of the curve and 
Variance factors which were tested separately in the first two tests*. 

Putting it another way, if the third test results in a "significant 
difference" finding, then the subgroup can be expected to show differ- 
ences by other tests as well — average grade of subgroup vs average grade 
of occupation, percentage of subgroup population reaching upper grade 
levels as compared to overall occupation population, and so on* 

Further, if the subgroup distribution shows up as significantly different 
for employees with up to ten years of service, it is highly likely that 
these differences will continue to be present for the foreseeable future. 

What a "significant difference" finding on this third test does mot 
automatically mean, however, — and this must be emphasized in the strong- 
est terms-r-is that these differences must have resulted solely and 
entirely from discriminatory treatment. On the contrary, as we saw 
earlier, differences can result from effective upward mobility programs, 
unusually high levels of occupational advancement, difficult labor 
market conditions, and so on. Differences can result from discrimina- 
tory treatment, of course. But the finding of a "significant difference" 
on the third test by no means equals a finding of discrimination. 



Another caution is also In order ^ this one more general. If the 
population under study includes employees/ who were hired long before 
EEO. programs began to have a real impact /on recruitment and hiring 
vpractlceSy past discrimination could be misinten>reted as an indicator 
of present discrimination in the occupational series studied » thus 
overestimating the extent of present discrimination. One way to avoid 
this is to provide for separate analydl:^ of persons hired more recently — 
say, since 1969 (E. 0. 11478) or 1972 (|he EEO Act). 

Conclusion 

Let us conclude this chapter by pointing out once again that GS810 is a 
powerful analytical tool for personnel specialists and other interested; 
officials to use in evaluating and comparing occupational and subgroup 
advancement rates. It will do many things for you and do them accurately 
and objectively. V But it does not and qannot substitute for the reasoned 
judgment of the analyst. .It can be an invaluable guide for pinpoiiiting 
areas of inquiry. But it is the analyst who must carry dut those "inquir- 
ies to their logical conclusion. 

In the next chapter, we will return to the step we alluded to at the 
start of this chapter: applying the log-probability techniques set 
forth previously to the key staffing needs planning problem of esti- 
mating hiring heeds. 



3^/Since a single run of GS810 involves the possible operator selection 
of several different options, each of which will result in a different 
run format, no one single sample run of GS810 is reproduced here. A 
reader can look at Appendix ( Sample Outputs) to study the outputs . 
produced by GS810. 
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CHAPTER 10 



$yAFFING NEEDS PLANNING COMPUTER PROGRAM: 



HIREST 



First-Year Projections 

The-€^^1 goal of the technology that has been described earlier Is to 
penolt pi&rsonnel m^agement officials to t^ake effective estimates of 
. future staff Ing needs in^ the oT:ganlzatlon(s) they serve. Such $.yture 
needs estimates are essential If personnel (officials are to rationally 
plan future personnel management programs to meet those needs. 

Xn some circumstances, of course, future needs can be estimated by 
simple extrapolations of past trends. 'As we showed earlier, when (a) 
employment In the future Is expected to follow the same trend that It 
has In the recent past, when (b) the same percentage of accessions are 
expected to be new outside hires, and when (c) no change Is expected In 
the percentage ^f the workforce who will be coming to retirement ellgl*^ 
Ijlllty, then — and onl'S^ then—curretit turnover and retirement tates can 
bf expected to continue In the future without significant change. In 
these cases, simply extend current loss rates\into the future unchanged. 

If, howeVlsr, some^ signif icant^d^parture from past trend is expected in 
one or more of these three conditions — ^employment, percent of new out- 
'^ide hires, or retirement eligibles — then either the tumoVfer rate or 
'-the retirement rate, or both, may be expected to change sigtilf icantly. 
Where this is so; estimates need to be made .of the direction and miagni-^ ' 
tude of such change (s). 

With our final computer program, HIREST, we now show how the L-P tech- 
niques described above may be combined id. th standard actuarial technique^ 
to yield such estimates. . 

To show how this is done, review Figure 10-1. Note first thati the 
"Growth*' portion of hiring needs is determined ^by the dif feremre between 
the'totial population in year X + 1 (next ye^r) and the total population 
In year X (this year). This diffe^rence can be determined ^iT^imple 
subtraction of the starting figure fr5^ the target figure./ 

t ; . ■ ■ ■■ T . 

Note also, however, that by far. the bigger share of the^ir ing needs 
total is accounted for by "Lo^ses>" This is where ojjar comb^inationVof 
techniques is required. ^ , > . ^ ^ J^v 

A relatively small |>ropoi:ti9n of total doVses^in an average org^nlzat lorn 
consists of losses for actu|trial-type reaspns: death, disability, and 
retirement. Figure 10-2 contains ^k; sample of actuarial tables for these 
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0.0005 
0.0006 
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0.0016- 
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0.0022 
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0.0054 
0.0061 
0.0066 
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0.0085 
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FEMALE PROBABILITIES 
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three types of losses^ Note that for both death and disability the « ^ ^ 
employee data needed are simply "Sex" and "Age." For retirement, how- 
evei', "Length of service toward retirement" is also needed. 

Not* also that what the tables provide is the probability of the employ- 
ee' s being lost to that cause during a year. Th^e are, in other words, 
annual loss probabilities. 

We can make up an estimate of the numbers of deaths, disabilities, and 
retiremefits4p be expected in our workforce in a year, then, by deter- 
mining each employee ' s probability of loss under each category and then 
adding up all of the death probabilities, all of the disability proba- 
bilities,, and all of the retirement probabilities. X^e sum of each 
category will *e our estimate of expected losses in that category. 

If, for example, we have 100 .employees whose retirement probabilities 
average 0.05, we would expect 5 retirements. If the sum of disability ^ 
probabilities was 0.95, we would expect one disability. If death proba- 
bilities totaled 1.03, we would expect one death. 

As you see, if we have each employee's date of birth (DOB), sex, and 
service computation date (SCD) , the finding of probabilities and adding 
them up to make one-year estimates is simple and straightforward. 

To these actuarial estimates, we must ne^tt add estimates .of the proba- 
bility of of her types of loss from a group during the year. To do this, 
we will use the L-^P equation computed for this group. 

Let us assume thac che L-P equation for our group—l.e. , occupation—is 

y = 1. 1 - 0.9 log X - * 

Then let ua assume that we have an employee whose entry on duty (EOD) 
date is four and one half years prior to (the start of our projection 
year. We first use the equation to get the employee's probability of 
survival to the start of the projection year by substituting 



4.5 



into the equation to get 



. y = 1.1 - 0.9 log (4.5y 
y = 1.1 - 0.9 (.6532) 
y = 1.1 - .5879 = 0.5121 
p(y) = .6957' 

Then to get that employee's probability of lasting to the end of the 
period, we- add one year to the 4.5 and substitute 

X = 5.5 



- 1^0 - 



into Che equation, getting 



y - 1.1 - 0.9 log (5.5) 
y - 1.1 - 0.9 (.7A0A) 
y -.1.1 - .6664 - .4336 ' 
p(y) - .6677 

To get <Che probability of the eorployee's lasting from the start of the 
year to the end, we divide the ending probability by the starting proba- 
bility; 

^ 6.6677/0.6957-0.9598 

Thu6» the probability of this employee's being an L-P type loss during 
the projection year is: 

1 - 0.9598 - 0.0402 ' 

» ■ ■ "' 

By using this method , we can determine the probability of L-F type loss 
for each entployee, using that employee's EOD date. These probabilities 
can then be added together jus^^irike the actuarial loss prottabllities to 
make up an estimate of group L-P type flosses for the projection year. 

'* 

No>7» since ve can estimate, for each employee, the probability of loss 
due to death, disability, and retirement reasons by means of actuarial 
tables, and the probability of loss due to other causes by.^eans of the 
L-F technique, it is obvious that if we add all of these pt:p})abilities 
together the sum total must equal the'^total probability pf that employ- 
ee's being lost for all causes during the year. And since we can make 
such estimates for all employees in the workforce as of year X, and 
since we can add dur individual estimates together to make gtroup esti- 
mates, we obviously^ can make estimates for total group losses during the 
year from date X ^to X -f^l . 

Further Years^ Projections 

From this point on, however, the mathematics gets a little more com- 
plicated because it is not enough to/ make projections only to point X + 
1. We must make them for aii additional dne (or more) years beyond point 
X + 1. And to make projections for a second period Involves some mathe-^ 
matical problems. ' 

Assume for % mom^t that you are part way along in a fiscal year. You 
have just gotten a file of employee^ data for the workforce on board as 
of date X» the beginning of the fiscal year which you are now in. If 
you make a projection of that; data through date X plus 1 year,. you hav^ 
madiea projection only through the end of the current fiscal year. / 
ObvlSaBly, theni-|if your desire is to make a projection through the / 
Budget Year, or^ beyond, you will have to make further years* project ipns 
in order to get what you need. 
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The matheinatical problem in making such further years' projections does 
not stem from our basic probability-estimating techniques— these yill 
work perfectly well for any year you name. Rather, the problem stems 
from the fact that estimateis for periods after the first are estimates 
,of conditipnal probability. . 

We c^ expladja^the problem this way. Suppose you bet a friend that if 
you flip a coin it will come up heads. What is your probability of 
winning your bet? Simple enough, 0,50, Suppose you then bet again that 
you can flip the coin and have it come up heads, what are your chances 
of winning a second time? Again, 0,50, Kor each independent trial, 
that is, the probability of heads is the same. And since each bet was 
for only one tribal, ypur probability of winning your bet was 0,50 both 
times. 

Now, however, consider the conditional-probability case vhere what you 
are betting is that you can flip two heads in a row. Now the question 
becomes, "What is the probability that you will lose (flip a tails) on 
the first flip and, What is the probability that you will lose (flip a 
tails) on the second flip?" 

The probability of flipping a tails on the first flip is still 0,50, as 
in the one-trial case, And^ in tho^e cases where you have gotten a heads 
on the first flip, the chance of getting a heads on the second flip is 
again 0,50. But remeniber, you had a 0,50 probabilitfy of getting a tails 
on the first flip and thus never getting a chance to take a second flip 
at all, ^So the probability of your losing on the second flip is 0,50 — 
the probability of getting heads on your second flip — times 0,50, the 
probability of the first event (getting a heads on the first flip) , 
which must have occurred 1>efore the second event (second flip) can take 
place. 

Thus your probability of Ibsing on the first flip is 0,50 and your' 
probability of losing on the second flip is 0,25, Yoxir probability of 
, flipping two hi^ads in a row is therefore what is left, or 0.25, 

This multiplying of probabilities together when the probability of a 
second' event is conditional upon the occurrence of a first event also 
must be done when you are making projections for more than one time 
period. 

Consider, for example, the case c?f^an employee who we determine has an 
L-P loss probability of 0,10, a «fe5ath probability 'of 0.01, a disability 
probability of 0,01, and a retirement probability of 0.02. 54n<ie the 
employee is present in the workforce at Jijie start of the year, the. 
probability of being present is 1.00, or 10(HK. certainty. We can set up 
our first-year projection table like this: ^. 
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Loss Prob, of being Net Prob. 

Prob, present » sjtart of year > of lose 

L-^P loss 0.10 X' 1.00 ' » 

Death 0.01 x 1.00 

Disability 0.01 x 1.00 « 

^ Retirement ^0.02 x 1,00 
Total 

Probability of retention to end of year: 

1.00 - 0.14 = 0.86 

For^our second projection period, Ihovever, ve must multiply the rates 

estimated for each employee who was present at the start of a year by 
this employee's probability of being present at the start of the second 
year, like this: 

Loss Prob. of being Net Pirob. 

Prob. present/ start of year of loss • 

/ 

L-P 0.0.9 X . _ 0.86 - 0,0774^ 

Death 0.01 x 0,86 . » 0.0086 

Disability 0.01 x ; 0.86 - 0.0086 

Retirement 0.02 x 0.86 « 0,0172 

Total 0.1118 
• ' .. ' •■ I 

Probability of retention to start of next year: ^ 

■** ■ 

0.86 - 0.1118 «= 0.7482 

And so on. Projections for a third or subsequent , year would be made In 
a similar manner » multiplying each annual loss probability by the 
probability of the employee's being present at the start of the yean 

Projecting "Hires Needed" 

If you will go back to Figure 10-1 now, you will see that by these 
methods ve have made a projection of the losses vl^ich can be expected 
during the projection period among those employees whom ve started out 
with in year X, the as-pf date of our employee data file. And you will 
note that the sim of "Losses" plus "Growth" equals "Hires," the number 
of added employees that you need to have on board as ojf date X +*1 in 
order for the workforce to be the required i^^e. 

Please note very carefully, , however, that this^ntanber of added employees 
does not quite equal the total nuimber of new accessions that must be 
made to the workforce in order to have the needed number of added 
employees on board at. the end of the year. Because your hiring is 
spread put over the year, some new hires will have quit before the end 
of the year. So you must actually make more accessions during thiB year 
than the number of added employees you need :at year's end, 
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How do we estlinate how many more accessions we will need? We use two 
methods, based on the two different kinds' of accessions that can be 
ma'dc», new outside hires, or in-seryice accessions of employees from 
other occupations or organizations. 

HI&EST asks for input on the percentage of total accessions that Are new 
hires (i.e., new Career-Conditional appointments). (This proportion, by 
the way, tendB to be remarkably stable in most cases over even widely- ^ 
varying conditions.) HIREST then takes this f igure— jgay, 60%— determines 
the percent filled by in-service accessions — in this case, 4pZ — and theil 
multiplies these percentages times the "Hires Needed" total from Figure 
10^1 to get estimates of the actual numbers of both kinds of accessions 
who will be needed by date X + 1.1./ 

In the first ^estimating method we use, then, we assume that new hires 
are spread evenly over the year. Thus, at the end of the year, the 
newly-hired employees will have an average length of service of 0.5 
year. To find what portion of these will have been lost to turnover, we 
substitute 0.5 Into our L-P equation: ^ 

y - 1.1 - 0.9 log (0.5) 
. f y = 1.1 - 0.9 (-.30103) 

y = 1.1 + 0.27093 = 1.37093 
p(y) = 0.9148 

To find the* number of new hires which must be made during the year per.,..^^^^^ 
eiriployee on board at the end of the year,, we divide the retention rate 
into 1: * ' ^ 

1/0.9148 = 1.093 

By multiplying this figure times the number of newly-hired employees 
needed on board at the end of the year, we get an estimate of the number 
of new hires we need to make during the year. 

Projecting losses from this new-hires group for further years requires 
only elementary L-P technique. If their average LOsZJ at year X + 1 was 
0.5, their average LOS a year later, at year X + 2, is 1.5. Another 
year later, 2.5. And so on. Substitution of these x-values into the L- 
P equation gives us a direct estimate of future retention (and by sub- 
traction, of future losses) in the manner described for LOCPRO^ earlier. 

The second estimating method we. use is for in-service accessions. In 
this case, since we cannot estimate their LOS, we must assume that their 
loss rates will be comparable to those of existing employees. Here, 
too, hpwever, we must also assume that these accessions wijl be spread 
over the whole year so that average service in this occupation is only 
0.5 year. Here, therefore, we estimate losses as equal to one-half the 



i/Note that if: the first- six-months loss rate for new hires differs 

substantially from that for in-service accessions, the final hires 
^needed" estimates will show a somewhat different ratio of inew hires 
^o total accessions than^was input into the model (e.g.. Figure 10-4). 
2^/lOS = Length Of Service ' 4 
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annual loss rate for the existing workers • Assuming the existiAg worker, 
loss rate is*14%» the number of new in-^service hires needed per hire on 
board at the end of .the year would be: . ^ ■ 

l.D - (0.14 X 0.5) » 1.0 - 0.07 « 0.93 
c \ 1.0/0.93 - 1.075 " 

Projecting future losses from this in-service accessions group is also 
straightforward. Since our starting assumption'was-that thi« group's 
loss rates were similar to those of retained .employees, we simply apply 
our rate estimates foif retained emplbyeies to this group without change. 

HIREST Requirements ^ » * , 

Since the above discussion giVes a »basic picture of HIREST's principal 
tfechniques and "assumptions, it is appropriate next to describe briefly 

^ a) What inputs HIREST requires ;„ 

- . b) What HIREST does;, and 

c) What it does not do. ■ 

HIREST's it^put requirements can be 3ummarized briefly. From the above 
discussion, it will be readily apparent what role each data item plays. 
Required inputs are: . , * ' 

1) An. employee data file Showing for each eirtployee in the ^ 

occupation (in the. following order): ^ i " ' 

- y • ' ^ ; 

DOB (Date of Birth) * 

EOD (Entry on Duty) date^/" 

Sex (1 = F, 2 = M) 

SCD (Service Computation Date)A./ 

2) T^ile "As-of" Date for employee data file 

3) The group's L-P loss equation*^.. , . ^ . 
A) Number of years projections wanted J ! - 

5) Percentage of total accessions who arQ new hires (i.e., 
new. Career-Conditional (appointmentsO 

6) Population estimates for the years covetj^d ^ 

Given these data, HIREST performs its calculations automatically and 
prints out. the results in summary tabulation form, a sample of which is « 
reproduced in Figures 10-^3 and 10-4. Note that these tabulations include: 



\) ' A summary of the group pc^ilation data which was input 
feariier. W 



3,/Date of first entry into service: The first entry on the" SF 7, Service 
Record Card, or in the Service Record File^ if automated. All dates in 
month and year only. fTg., 0253 (« 2/53>. ^ , ^ 

1/a sample line of file for a male, born 8/44 with an EOD date of 9/68 
.and an SCD date of 6/64 would be: JD84A096820664. 
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Figure 10-3 



tmiR 5- SPACE NAME GF . ' 
EMPLOYEE DATA FILE, 




MP322 :. 




ENTER NG. OF EMPLOYEES 
IN MP322 FILE 




112 




ENTER MP322 AS-OF (I.E., 

CURRENT) DATE IN MO., YR. (E.G. ,05,75) 




1.76 


y. 


ENTER A. 8 OF MP3 22 L-P EQUATION 




.A635.-1 .35693 . _ 




FILE MP322 READ. NEXT: 




ENTER NO.'*OF FISCAL YEARS 
PROJECTION WANTED (1-5): 





5_ 

ENTER MC, YEAR OF START 

OF FIRST FISCAL YEAR (F,G., -10,76) v .. 



7. 76 



ENTER MP322 POrULATION 
AT START OF f V 1 97 7 • 



120 

ENTER MP372 POPUi AT ION ' 
AT END or ry^r/Ni VFAO 



1 977i 
130 
1 978; 
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•HTER FRACTION OF MPJ22 TOTAL 

fliCCESSIONS WHO ARE NEW HIRES (E.G. ,0.25)5 

.5.0 ' 



PERI.DD: 

population: 

START 
END 



SUMf^ARY OF ESTIMATED LOSSES AND GAINS 



(BASE 



112. 
120. 



]977 



130. 



1978 



130. 
; 138. 



1979 



138. 
I'* 5. 



1980 



U5. 
136. 



EST. 'GAINS 

NEW HIRES 9. 

ACCESSIONS 7. 

TOT. GAINS 16. 



RUN AGAIN? (Y OR N) 

Tlop 



16. 
12. 

/8. 



1 5. 
1 2. 

27 



15. 
1 1 . 

26. 



5. 
A. 



'V 



1.981 



13 6. 
130. 



EST. LOSSES NO: 


% 


NO. 


z • 


NO. 




NC. 


% 


NO. 


% 


NO. 


% 


LOSS * 


7. 


5.S 


16. 


13.0 


17. 


12.8 


.17. 


12.'» 


16. 


V0..9 


]h. 


9.9 


DEATH • 


0. 


0.1 


0. 


cr. 2- 


0. 


0.2 


^ 0. 


0.2 


0. 


0.2 


0. 


0.2 


. ,DISA3. 




0.2 


1 . 


0.5 . 


1 . 


O.A 


1 . 


O.k 


1 . 


0.5 


1 . 


0.5 


kETIRE 


d.' 


O.J*. 


1 . 


l-o 


1 . 


0.8 


1 . 


0.7 


1 . 


0.8 


1 . 


1 .0 


TOTAL 


7. 


6.6 


1 £. 




19. 


1*4.3 


19. 


13.8 


18. 


12.^4 


16. 


11.7 


*(RIF) 


0. 




n . 




0 . 




0. 




.0. 




0. 





6. 
It. 

10. 



J'" 
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2) A suminary of loss estimates, by type of loss j with data 
ahowii as both nxunber and percent. 

3) A special detail line showing the number of lossesV if 
any, which are expected to be RIF's.^ 

4) A summary of gains estimates, broken down by New. Hires 
■ \ and In-Service Accessions. 

^ ^ ■ • 

On an overall basis, then, it may be said that what HIREST does is to \ 
perform t^ functions. First, it serves as a projection model by means ^ . 
of which personnel officials are able to make projections of occupational, 
firing on a multi-year basis. 

Second, It serves as a simulation model whose input variables can be 
delibexvately varied tq test the effects of input changes on occupational . 
hiring levels, turnover levels, etc. -One , can test the effects of specific / 
changes in population levels, for exanple, on the number pf turnover .losses, 
or RTF's, or >in-8ervice accessions. In this way, HIREST cian help advise^" 
management of some of the personnel implications of pxJoposed management 
actions before such actions are taken. '■■M- 

It should alscft be noted, however, that there are certain things that 
HIREST is not or does not do. First, it is not infallible. It provides 
a means of making "probable value" estimates. But these are only estimates.. 
They are not last-digit-accuracy predictions. There are too many uncertain- 
ties in jthe total process to make unqualified predictions a realistic pos- 
sibility. As one example, HIREST assumes that hiring will be evenly dis- 
tributed over each of the projection years and accordingly has estimated 
turnover among new hires using a 0.5 averaging factor. If real hiring fs 
concentrated early in the year (averaging factor, say, 0.7), however, or 
late in the year (factor of, say, Q.3), HIREST estimates will be signifi- 
cantly off target as a result. 

Second, and in a way related to the first point, HIREST is a stochastic 
or probabilistic model but it does not provide confidence-range estimates 
for its projections. In the program LOGPRO, for example, we provide limits 
above and below projected values within which actual values can be ^ected 
to fall in 95% pf possible ^ases- In HIREST, such 95%-conf idence limits 
are not provided. ^ _ > 

In part, this reflects the situation discussed first, above, that there 
are many unknown variables whose effects cannot be estimated beforehand. 
And in part, this reflects the technical problem that whereas we do have 
confidence-level data on L-P curve equations, we do not have such data 
^ for the actuarial'^^^fcables that make up a substantial part of our loss esti- 
mates. For both reasons, providing 95%-conf idence limits for HIREST pro- 
jectiohs is not feasible. 
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And l^ird) wihlle BIREST does prov;Lde estimates of both Inrpervlce a^d 
outQlde; hiring V 1^^ does iiot provide atly breakdown of hires by grade 
level. These vdll have to be estlinated by the analyst by etcher means. 

Cottclttslons 1 ' 

HIREST provides a very powerful and very flexible tool for staffing 
needs- analysis « Gbmblnlng two techniques of proven validity — actuarial 
technique and log-probablllty analysis— It enables personnel officials 
(a) to simulate for management the major turnover and hiring Implications 
of management's alternative workforce plans » and (b). to project the 
future turnover and hiring levels which may be expected under management' 
approved workforce plan. Thus> HIREST performs the central analytical 
tasks necessary for staff Ing needs plapnlng in the operational setting. 

'As with any projection model, jof course». HIREST projections are H)nly 
probable values — they are In no sense predictions possessing any last- 
digit accuracy. Also, HIREST projections retlect^ the basic assuntptlons 
tised In the model, such as hiring distributed evenly during projection 
years. To the extent that these assumptions are not borne out^ HIREST 
results may be affected accordingly. 

On the whole, however, HIREST can generate a great deal of very valuable 
management and personnel management information from relatively simple 
and straightforward elements. We believe that it is an effective first 
step toward the still more powerful and refined models of the future. 
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CHAPTER 11"^ 
- THE ROLE OF THE ANALYST 



When Abraham Lincoln delivered' his f amous "House Divided" speech iii: 
1858, it was at a time of unprecedented danger for the Union. The 
United States would Soon be engaged in a great civil war and the 
responsibilities facing the Nation's leaders were awesome and for- 
bidding. In the very opening lines of this speech, which is remembered 
as his most notable address before becoming President,' he summed up 
the Information needs of our leaders at that fateful hour in words of 
matchless precision: 

"If we could first ^ know where we are, and whither we 
are tending, we could better judge what to do^ anci how 
to do it . . . ." 

Every analyst should memorize these words. Not only because they 
express with absolute clarity the .^fprination needs of eyery decision- 
maker. But also because they show the ^way for every analyst, nd 
matter where located, to make a vital contribution to the effectiveness 
'of our system of government. 

You have seen in our earlier discussions how to perform the key analytic 
functions of staffing needs planning. You have seen the policies and 
functional provisions needed for effective staffing needs planning 
programs. You know what planning data are needed from management. You 
know how to analyze and project turnover. You know how to estimate 
future staffing needs. 

With this knowledge, you can now provide det is ion-makers with many types 
of information which they vitally need to djo their jobs better. If the 
manager proposes a future course of action requiring a workforce struc- 
ture that cannot be staffed, you can now say so. If there are workforce 
plan changes which could be made and which would make staffing feasible, 
you can make known those needed changes. If the needed workforce cannot 
be delivered at the cost specified, you can provide better estimates. 
And, if the required workforce cannot be delivered on the sctiedule 
necessary under man^^g*=^inent: ' s program plan, you can inform the manager of 
this. 

These are information items that the manager urgently needs to be sure 
that the plans being made, or the alternative program propo^fls being 
.-^jtiv-^j 4*, ff,r>r -roal a-nA viflhlp — nlans which in actual : fact can hi 



at tne pxans oejuig mduc, wj. ui*^ orj-u^^^^^^-w t^-w^ ^^^^^^ 

i^hed, are in fact real and viable—plans which in actual : fact can be 
Tried oiit^ ' 

r V 

If, because these data were not provided,, the manager submits to top 
executives, or to legislative bodies, program plans or decision alter- 



c 



natives which in actual fact: 



tCannot be carried out at all because the 'needed personnel are 
not available; Qr which 



^ cannot be carried out within the coist levels specified; or 

which , : 

' . '■ ■. ■ . * ■ . 

- catuiot be carried out on schedule because needed hiring and/br 
training cannot be completed in time; 

then because of the failure of the analyst, executive and legislative*^ 
actions will be taken and program performance promises will be made that 
the passage of time will show were seriously in error. 

In such circumstances, the work of executive and legislative decision- 
makers can become no more than guesswork, ^d the performance /promises of 
government can become literally incredible to the very public .that govern- 
ment exists to serve. 

. ■ " '■ 

Such failures of analysis as these can thus contribute in no small 
measure to the severe impairment of public confidence in the word and 
workings of government. • i& 

Clearly, as we said in the beginning, major Improvements in analytica.1 f 
methods and techniques are urgently and vitally needed. And the further 
^we progress toward truly multi-year planning, the more Intense this nee*d 
becomes. 

Everyone's contribution is needed in this improvement effort. The 
techniques described in this handbook, we believe, represent one major 
step forward. But they are not by any means the last possible word. 

On the contrary, they will in their turn be supplemented and eventually 
supplanted by other, still more effective techniques. Perhaps some of 
the readers of this handbook may be given or may take the opportunity to 
contribute to this progress. If you do get such an opportunity, we hope 
you will take it and that you will give it your best effort. 

■} 

Every improvement that can be made in staffing needs planning — or 
indeed in the whole manpower planning process — is an improvement in the 
ability of government managers — and thus, of our executive and legisla- 
tive officials — to tnake public decisions more effectively. 

Every improvement in analysis, then, enhances the ability of our govern- 
mental system to be responsive to the JLg^prmed wishes and choices of the 
, people at large. 

And that, ultimately, is what the job of^-^he analyst — and indeed that 
of every public servant — is all about. 
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- MANUAL CALCULATION OF , THE LOGrPROfBABILITY CURVE 



Probl em S 

Determine the A and 8 values of a log-probabi 1 I ty retentl on eqgatfon 
'usi ng: 

- Manual least squares techniques 

' and » 

- Normal curve area and logarithm conversion tables, 

1. • 

Data Collection s 

RetehtM on data" are obtained by determining the number or percent of 
employees retained from a given group of hires after specified lengths 
of service. Ideally, a group should be composed of employees. In the 
sarpdioccupat ion. However, If the number of employees in one or more 
occupations Is small, then it Is possible to combine 1 i ke occupat I ons. 

To analyze a ^.Iven group of employees, two types of Information must 
be recorded- 

(1) The number of employees in the original group 
of hi res ; ancf 

(2> The number (orvPercent) of these employees re- ^ 
tained after given lengths of time. 

The "griglnal group" is composed of employees hired during a specific 
time span (one year or less). The "given lengths of time" consist of 
a standard time unit and, in most cases, an averaging factor which 
approxlinates the actual average length of service of the original group 
at the end of the hiring span. 



Sample Data * 

\ The following is a sample* of retention data which will be used in 

the calculation*; below: 

Number hired I orrnp^^t- 1 on V * fiscal Year 1970 = 250 



At end Number Percent 

of FY I R e t a i ned Retai ned 

71 208 83.2% 

72 189 75.6% 

73 1 76 70.4% 

74 164 • 65.6% 



The averaging factor used will be 0.5 years (whi ch assumes an even 
,dl str I but ion of hiring during a fiscal year). Thus, the value for 
FY 71 Is 1.5' years, for FY 72 is 2.5. etc. 
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- Page 2 

the Equation s 

The mathematical form of a line Is given by the equation^ 

Y = A ^ fix 

Retention data (also- called log-probability data) c^n be expressed 
linearly by making two transforfnat Ions: 

(1) Percent of employees retained Is transformed Into 
number of standard deviations from th e mean of a 
normal curve ; and ♦ . 

(2) Years of service completed Is transformed Into the 
logarithm of years of service completed . 

These two transformations can be made using the two tables I ncluded 
In this instructions 

(1) The Normal Curve Area Conversion Table 

(2) The Table of Logarithms 



Computat i ons • ^ 

To solve the stated problem It is necessary tc^ transform the retention 
data into their linear form and solve for the coefficients A and B. 

To begin, let* 

N = Number of retention points (i.e., number of observations) 
I = Retention poin^, i = l,2,...,N 

tj - Length of service represented by retention point \ 
t\ = Base 10 loqarUhm of tj 

P(tj) " Perrppf of rhe orlqinal group remaining at time tj 

- Number of standard d*-viations from the mean of a normal 
curvp roprp*?*»ntpH by the v^liie Pttj) 

This instruction^ will romptjto A ;^nH B using the sample data and- 

Case I 2 Two ye^r s of ret**nrion data 

Case IP Threp or more yf>^rs of retention data 

Case I If only two years of retention data (or two retention points) 
are avai lable, then the*A and, B values of the I dg-probabi 1 1 1 y equation 
may be computed using the following formulas^ 
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B =^ Y, Y2 (1) ■ Page 3 



.i ■ . 

A = Y, - BX, w ' (2) 



Using the\jL.I.,rst two points of the sample data^ . ^ 

t, = 1.5 P(t, ) = 83.2% ' 
- t2 = 2.5 P:(t2) = 75.6% ^ 

From the Table of. l^ogarlthms, we obtain- 

X. = 0. 17609 
■ X2 = 0.3979^ . 

And from the Normal Curve Area Conversion table and Interpolation (see 
Page 7): 

Y, = 0.96223 
. ^ Y2 = 0.69358 

Substituting these values Into formulas (1) and (2), wie gets 

B = 0.96223 - 0.69358 . ' 

0. 17609 - .0.39794 

. = -1 .210953 

And, 

A = 0.96223 - (-1 .210953)(0. 17609) 
= 1.175'*67 '. 
Thus, the.^ log-probabi 1 I ty retention equation In this case IsS 

Y = 1.175'*67 - 1.210953X 

Case- 1 1 . If there ace three or more years of retention data available 
the A and B values of the 1 og-probabi 1 I t.y equation may be calculated 
In one of two ways! 

(1) Simultaneous solution of the "normal equations"; or 

(2) Substitution of values Into the "linear fit algorithm". 

.•// . . ■ .. . . 

?hese methods are both variations of the "least squares" techni que. 

Whichever method Is chosen, several quantities must be calculated. 
For this purpose. It Is useful to set up a work table In the follow- 
ing form (using allthe sample data points)^ 
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At:'!'- •'■•;:Vvi';- 



Year 



Log 
Year* 



Percent 
ReiDalnlng 



Stan. Dev. - 
From Mdlln** 



XixYi 



Page 4 



1 .5 
. 2.5 
3.5 

,A;5 



0,17609 
0.39794 
0.54407 



%.2 
75.6 
70.4 
65.6 



0.96223f 
0.693^ 
0.535f9 
0.401 60 
2.59340 



0.16944 
0,27600 
0.29162 
0.26233 
0.99939 



Using Table of Logarithms 

Using Normal Curve Area Conver si orv^^b I e and 
interpolation 



0. 03101 
0. 15836 
0.29601 
0.42668 
0.91206 



1 1 - 1 . The normal equations areS 

SY = NA + 82X . (3) 

2XY = AXX + 82X^ (4) 
.• rom the above work table, we haveS . • 

■ ^' *^ SX = 1 .77131 

SY = 2.59340 

2XY = 0.99939 

1X2( = 0.91206 

• . N = 4 . 

Substituting these values into equations (3) and (4), we obtain: 
/- 



# 



2.59340 ■= 4A + 1.771318 {3a) 
0.99939 = 1 .77l:31A + 0.912068 (4a) 



Solving these equations simultaneously' (see Page 8 ) , we have! 

A = 1.165275 
• 8 = -1. 167328 

And the log-probabl 11 ty equation ts^ 

Y = 1 . 16S775 - 1 . 167328X r 



M - 2 . The linear fit algorithm is' 

8 = N£XY - 2XSY 

. . N2X2 - (2X)2 

. A = SY - 8IX 



N 
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(5) 
(6) 



;Usln.g.tt(e va1'ue$ from the work table, we. get J r ^. 

l^vjV- V • . .b^ >(0. 99939) - (1 .77131 )(2.59340) 

P-.,./^-:,/, " . A(-0. 91206) - (1 .77131)2 v.; 

>■ '". ■' V. • ■= '- 0.596155 ■ ■'■ . 

• . 0.510701 

' . = -1.167328 



A = 2. 593^*0 - (-1.l67328)( 1 .77l3t) 

5 ■ 



And, 



./ - 1. 1652 75 

■ ■ .V . ■ 

Thus, the log- prpbabi 1 J ty equatFon is^ ' * . 

Y = K165275 - 1.167328X- (7J ' , 

Iteration and" Project i on : 

The processes of iteration and projection are conceptually the .same. 
r>They both involve the substitution of X-values into a given equation to 

obtain the Y-valu.es given by the* line (Y"). The on I y d i f f erence be- 
^tween the two I s ^n the X-values which^re used, ^ 

In iteration, previously-observed (or'past) X-values are used to ubtai n 
the equation values. For example, using equation (7) and the first X- 
value in the sample data (i.e., X] = 0.1 7609) ^we obt.aJ^nS 

.-. YT = 1.165275- 1.167328(0.17609) 

P = 0.95972 . 

Using the Normal Curve Area Conversion Table and i nter po ^t i on , this 
value may be converted to percent. In this case, the percent value Is 
83. U%. . - 

Iterated values are used In the calculation of standard deviation and 
variance and 'other goodness of. f i t'measu res . 
• » 
For projection, future X-vdlues are used to obtain estimates of the 
number retained from an oriainal group at future points In time. For 
example, using equation (7) and year ^,5, we haveT 

t - 5-5 

V X = 0.7<*036 ' — . ■■ 



And, 



Y' = 1.165275 - 1.167328(0.7^036) 
r 0.30103 (or 61.83%) 



Projected values may be used in planning for f uture . hi r I ng needs. 
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i rage D*^ 

Standard Deviation s • . ^ ■ y 

To ca leu Tate a st-andard deviation from the >log-probabM I ty line of r e- 
gresslbn ^t"I$ necessary to (a) Itef?ate the regression equat I an f<>r .a 11 
the past values of X and (b) determltie the dl/fferences between the ac- 
tual Y-values and the curve values tY"). m js^ can be done using the 
following work table and the sample datas . 



Year 


Actual 


■ 


Curve 


UlTTm 




2 


(tl) 


(Yi) . 




(YT) > 


(D=^Y|-YT) 




(d2) 


1.5 
2.5 
3.5 


0.96223 
0.6935a 
0.53599 
•O.ijOl 60 




0.95972, 
0.70075 
0.53017 
0.40276 


0.002.51 
-0.0071 7- 

0.00582 
-0.001 16 


0. 
0. 

a. 
0. 


0000063 
00005U 
0000339 . 
0000013 


The formula for the s 


tandard , 


dev 1 at i on i s 2 • 
1 






















» 






1) 








In this 


examp 1 e. 













XD^ = 0.0000929 



And, 



0.0000929 - 0.00^565 



Note thic: v::*tu*^ ^ ^> r:^r,'*H.-^rr) d^viatFon<: from th*> m^rin of the 

norma 1 cur vp . 



Confidence Interval fbr Project I pns = 

In addition to simply proJe^ti'>g a 1 og- pr obab I I i t y equation for future 
points, it is ^Iso po^^ible to calculate ^ 95"/- confidence range for 
each pro ject *»d n'ol r>t . T hi » Hor^o us I mo t hf> r ^n<i^t''<\ devl at I on. To 
begi n , 1 et • 

5 - Standard deviation from t Hp ) og- pr obabi H ty j t ne 

Y'^ = AproJectedcurvevalue 
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H = 2s 

^ L = - 2s 

Both H and L are^In the form standard deviations from the mean of the 
normal curve. Using the Normal Curve Area Conversion Table, these 
values can be converted to percents' and these percents will constitute 
the confidence range. To convert to numerical values, simply multiply, 
each percent (In decimal form) by the number In the original starting 
groups 

For example, using equation (7) and t = 5.5 years, then:. 

Y"' = 0.30103 

And, ^ 

• Ik. 

s = 0.005565 • - 

2s = 0.01 1 1 30 

Thus, - ^ 

H ~ 0.30103 ^ 0,01113 = 0.31216 

L - 0.30103 - 0-01113 = 0.28990 

Using the Normal Curve Area Conversion Table and interpolation, H and L 
convert to 62.25% and 61.^1%, respectively. The percent vaVue assocf- 
-ated with Y" Is 61.83% (tbls Is also known as the. "expected va 1 ue" ) . 
Numerically, the range would be (with 250 In the starting group): 

HI gh: .6225 x ^50 - 1 55.6 

Expected Valued .6183 x 250 = 15^.6 

Low:' ,61^1 X 250 - 153.5 

For planning purposes, these figures rnay be rounded. ^ 



Li near Interpol at 1 on : 

The process of linear Interpolation Is used to read between the lines 
of a statistical table; I.e., to calculate intermediate values. 

In general, suppose that value is represented In a. table by value T] 
and is represented by T^- What is the T-value associated with the 
value V (V2<V<V])? The value of T can bie calculated from the following 
rat I o: ' ' \ 

V 7 V2 T - T2 ♦ \ 



V, . V2 



Tl - T2 
. 163 . 
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Solving this equat ton for T. we get: 

/ V - V2 \ 

T=T2*(_ \(Ti - T2) (8) 

. \Vi - V?/ 

This process tan be used In both of the tables attached to this I nstruc- 
t Ion, ^ 
F-r example, usl^g the sampU data and the Normal Curve Area Conversion 
Tabltrwhat Is the standard deviation value associ atea ,wl th .83.2%,? In 
this case, 

• ' V = 83.2% J = , ^ , , 

Vi = 84.0% Tl = 0.99'»'»6 , . , 

. . V2 = 83.0% T2,= 0.3Si*\7 

Using equation (8). /I'i 

T = 0.95417 * / 83.2 - 83 .o\ ( 0 . 99446 - 0.95417) tV -'r 
\^64.o - 83.0/ 



= 0.954)7 (0.2)(0. 04029) 1 
= 0.96223 

T-hus. the table value associated with 83.2% Is 0.96223. 

Interoolatlon may also be used fn the other direction; I.e.. with. the 
lund^i devUUons columns representing the V-values and the percent 
remaining columns the T-vaJues. 

i« i-h« fir.;t 1 terate<? value from the 1 eg- probabi 1 1 1 y' equat I on 
For example, the first 1 ter acea va 1 uc . ,ocorIated? In this case. 

Is 0.95972. With what percent value Is this associ atear m cms 

V = 0.96972 T = ? 
Vi = 0.99446 • Tl = 84.0% 



And, 



V2 = 0.95417 T^ = 83.0% 

N % 

T . 83.0 * / n. 95972 - 0.9541 7 \ (84.0 - 83.0) 
^0. 99446 - 0.9541 / I 



= 83.0 ♦ 0. 14 
= 83. 14% 

■ 4 



Simultaneous Equations - 

are of the form^ ^ ^ ' - 



- ■ , Page 9 

S,. = Q,A + Q28 (9) 

S2 = RlA + (10) . 

where S^, S2» Ci'ij;v.^Q2» ^it and«R2 are all known quantities. ' 

The" techni que for solvi ng two simultaneous equations In two unknowns 
can be shown using the normal equations%in this instructfon as an 
example. These equations are* \^ 

"2^593^0 = 1,771318 (3a) 

0.99939 = 1 .77131 A 0.91206B (^a) 

The first step In solving these equations is to* e 1 1 ml nate one of the 
unknown values (A or B) from them. If A is selected for elimination, 
then each term In equation (3a) is multiplied by the coefficient of 
A in equation (i^a) ('I.e., 1.77131) and each term In equat I on: '(4a) Is 
mul.tlplled by the. coef f i ci ent of A in equation (3a) (I.e., 4). After 
these multiplications have been comp 1 eted the. two normal equations 
now 1 00k 1 1 ke t hi s 2 

4.593715 - /,085240A + 3.1375396 -(3b) 
3.997560 = 7.085240A + 3.6482408 •(4b) 

The next step Is to subtract one of these equations from the other to 
get one equation aaontal nl ng only one unknown (since the A terms will 
cancel out). If equation (^b) lb subtracted from equation (3b) the 
r esu i t I s 2 

0.596155 - U.510/01B 
Solving this equation for B glves^ 

B ^ 0.. 596155 
-0:"510701 

^ 1 . 16/328 

This calculated value of b can now be substituted back into one of the 
two original equations to yet a value for A. If equation (3a) is se- 
lected, the result would be: 

2.59iAO -.AA + 1 .77131 (-1. 167328) 

And, 

' A = I . 1652/5 * * 

It is possible to check these results by subst i tut I ng the calcuilated 
values for A and 8 Into equation (4a). This would glve^ a 

0.99939 = 1. 77131(1.165275) ^ 0 . 9 1 206 ( - 1 . 1 67328 )\ 
= 2.064063 - 1 .064673 
= 0.99939 
^ 165 - 
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-TABLE OF LOGARITHMS- 



YEAR LOG (10) 



0.1 
0.2 
0.3 
O.k 
0.5 
0.6 
0.7 
0.8 
0.9 

■4^ 



-1.00000 
-0.69897 
-0.52288 
-0.3979M 
-0.30103 
-0.22185 
-0.15*90 
-0.09691 
-0.0*576 
0.00000 



YEAR LOG (10) 



5.1 
5.2 
5.3 
5.* 
5.5 
5.6 



0.70>57 

o.moo 

0.72*28 
0.73239 
0.7*036 
0.7*819 
0.75587 
0.763*3 
0.77085 
0.77815 



YEAR LOG (10) 



10.1 

10. 2r 

10.3 

10.* 

10.5 

10.6 

10.7 

10.8 

10.9 

11.0 



1.00*32 
1.00660 
1.0128* 
1 .01703 
1.02119 
1.0253t 
1.02938 
1.033*2 
1.037*3 
_LiO*li2- 



1.6*532 
1.0*922 
1-05308 
1.05690 
1.06070 
1.06**6 
1 .06819 
1.07188 
1.07555 
1-07918 



r.i 

1.2 
1.3 
1.* 
1.5 
1.6 
1.7 
!.d 
1.9 
2.0 



0.041139 
0.07918 
0.1139* 
0.1*613 
0.17609 
G.20<t12 
0.230*5 
0.25527 
0.27875 
0.30103 



6.2 



6.8 
6.9 

J*0_ 



0.79239 
0.7993* 
0.80618 
0.81291 
0.8195* 
0.82607 
0.83251 
0.83885 
0.8*510 



11.1 



11.6 
11.7 
11.8 
11.9 
12.0 



271 
2.2 
2.3 
2.* 
2.5 
2.6 
2.7 
2.8 
2.9 
3.0 



0.32222 
0.3*2*2 
0.36173 
0.38021 
0.3979* 
0.*1*97 
0.*3136 
0.**716 
0.*62*0 
0.*7712 



7.1 
7.2 
7.3 
7.* 
7.5 
7.6 
7.7 
7.8 
7.9 
6.0 



0.85126 
0.85733 
0.66332 
0.86923 
0.87506 
0.88081 
0.886*9 
0.89209 
0.89763 
0.90309 



12.1 
12.2 
12.3 
12.* 
12.5 
12.6 
12.7 
12.8 
12.9 
13.0 



1.08279 
1 .08636 
1.08991 
1.093*2 
1.09691 
1.10037 
1.10380 
1 .10721 
1.11059 
UM39* 



3.1 
3.2 
3.3 
3.* 
3.5 
3.6 
3.7 
3.6 
3.9 

*.o 



0.*9i36 
0.50515 
0.51851 
0.531*8 
0.5**07 
0.55630 
0.56820 
0.57978 
0.59106 
0.60206 



XI- 

6.2 
6.3 
6.* 
6.5 
6.6 
6.7 
6.8 
6.9 
9.0 



^.908*9 
0.91381' 
0.91908 
0.92*28 
0.929*2 
0.93*50 
0.93952 
0.9***8 
0.9*939 
0.95*2* 



13.1 
13.2 
13.3 
13.* 
13.5 
13.6 
13.7 
13.8 
13.9 
1*.0 



1.11727 
1.12057 
1.12385 
1.12710 
1.13033 
1.1335* 
1.13672 
1.13988 
1.1*301 
1.1*613 



1. 1*^22 
1.15229 
1.1553* 
1.15836 
1.16137 
1.16*35 
1.16732 
1.17026 
1.17319 
1>«7609 



*.1 
*.2 
*.3 
*.* 
*.5 
*.6 
*.7 
*.8 
*.9 
5.0 



0.61276 
0.62325 
0.633*7 
0.6*3*5 
0.65321 
0.66276 
0.67210 
0.6812* 
0.69020 
0,69897 



9.1 
9.2 
9.3 
9.* 
9.5 
9.6 
9.7 
9.8 
9.9 
10.0 



0.95901* 
0.96379 
0.966*8 
0.97313 
0.97772 
0.98227 
0.98677 
0.99123 
0.9956* 
1.00000 



1*.2 
1*.3 
1*.* 
1*.5 
1*.6 
1*.7 
1*.6 
1*.9 
15.0 



/ 
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•NORMAL CURVE ARE/^ CONVER'SION TABLE- 



ERIC 



PERCENT 
REMAINING 



STANDARD STANDARD STANDARD 

DEVIATIONS PERCENT DEVIATIONS PERCENT DEVIATIONS 
FROM MEAN REMAINING FROM MEAN REMAINING FROM- MEAN 



95 


1 .6kk8S 


65 


0 .38^3 2 


35 


9'» 


■1 . 55*4 77 


6*4 


0. 358A«6- 


3*4 


93' 


1 .'4 7579 


63 


0.33185 


3 3 


92 \ 


1.. 140507 


62 


0. 305*48 


32 


91 


1 . 3*4076 


6 1 


0.27932 


J 1 


90 


1 .281 55 


60 


0 . 2 5 3 5*5 


7 n 
3U 


89 


1 . 22653 


59 


0 . 227 55 


0 Q 
Z 7 


88 


1 . 1 7*499 


58 


0.20189 


2 0 


87 


1 . 12639 




n 1 7 A 7 


t / 


86 


1 .08032 


56 


•0.1 5097 


26 


85 


1 .03614 3 




0. 1 2566 


25 


6k 


0.99*4 '46 


5*4 


0. 1 00*43 


2*4 


83 


0.95*41 7 


53 


0.07527 


23 


82 


0.91537 


52 


0.05015 


22 


81 


0.87790 


51 


0.02507 


21 


86^ 


0.8*4 162 




0.00000 


20 


79 


0. 806*42 


'*49 


-0.02507 


1 9 


78 


0.77219 


*48 


-0.0501 5 


1 8 


77 


0.73885 


*47 


-0.07527 


1 7 


76 


0. 70630 


*46 


-0. 100*43 


16 



■0. 3 853.2 • 
.0l*4l2*46 
■'0**4 3 99 1 
-0.*46770 
-0.*49565 



•Q..52*4*tO 
■0.55338 
-0. 5828*4 
-0.61281 
•.0.6'»335 



■0.67*4*49 
■0.70630 
■0.73885 
■0.77219 
■0. 806*42 



-0.8*4162 
-0.87790 
-0.91537 
-0.9.5*4 1 7 
-0.99*4^6 



.75 

7*4 

73 
72 
71 



0.67*4*49 
0 . 6*4 33 5 
0.61281 
0.5828*4 
0.55338 



*4*4 
*4 3 

'•2 
11 



12566 
1 5097 
17637 
201 89 



-0. 2275*4 



15 

1 *4 

1 3 
1 2 
11 



- 1 . 036*4 3 
'-1 .08032 
-1 . 1 2639 
-1 . 17*499 
-1 .22653 



70 
69 
68 
67 
66 



0>^^0 
0.*4 9?8^ 
0.*46770 
0.*4 3991 
0.*4 1 2*46 



39 
38 
37 
36 



■0.25335 
■0.27932 

■ 0. 305*48 

•0.33185 
-0. 358*46 



10 
9 
8 
7 
6 
5 



1 .281 55 
1 . 3*4076 
1 .*40507 
1 .^7579 
1 .55*477 
1 .6*4*485 
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APPENDIX B 

SOMfc APPLICATIONS OF THE 
L0G-PR0dA3ILI TY EQUATION 
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Question^ What is the probability that an employee who has already had 
I t years of Federal service will have at least one more? 

: Suppose that the 1 og-pro^abl 1 I ty equation for a' given employee"*s oc- 
pat I on i s i 

Y = 0.953^ - 0.9l02Log(t) 



And suppose that , 



Then , 



t = the empl oyee's length of service 
(prefer ably based on original en- 
try on duty date) 
= 3.75 years 

t%l r the employee's length of service 
plus one year 
- ^.7S years 



Y = 0.9S3'* " 0.9l02Log(t) 

= 0.953'* - 0.9102Log(3.75) 

- 0.953^ - 0. 9102^0. 57403) 

= 0.i»3092 



Converting Y^ (which Is in standai d deviation units) to percent gives* 



And, 



- 0.666/ 



Yj-^l - 0.95)3'* - 0.9102Log(t+l ) 

= 0.9534 - 0.9102Log(4.75) 

= 0.9534 - 0.9102(0.67669) 

= 0.3374b 

Converting Y^^| to percent gives^ 

The retention probability is ylven by the formula^ 



In this casei 



R - ,632 1 - 0.94 81 
. .6667 



Thus, the probability that an employee who has served 3.75 years will 
serve 4.75 years is 0.9481*. 
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Quest I on^ 
2 



Whdt Is the probaiblllty that an accession to your organ- 
ization wM I stay wl th you for at least two years If (a) 
the accession Is a new outside hire and (b) the accession 
Is a reassigned employee who has a I ready been with your 
organization for 3 years? 



Suppose that the 1 og-probabl I I ty equation for the occupat I on- I n 
which a vacancy Is to be filled Is- 



Y = 



0.9534 - 0.9102Log(t), 



Case a. 



Suppose that this vacancy Is filled by a new outsl de,. hi re. 

The probability that a new hire will be with the or ganl zat I on I n 
two years Is found by Iterating the log-probabl I I ty equation for 
the t-value of 2. This glves^ 

i^Y = 0.953'* - 0.9l02Log(2) 

= 0.9534 -'O..9:r02(0.30103) 

= 0.67940 

This Y-value transforms to a probability value, of 0.7516. 



Thus^ for thl$ occupation, there Is a 75% chance 
will stay with the Job for at least 2 years. 



that a new. hi re 



Case b, 



Suppose that theivacancy Is- filled by a reassigned employee 

who has already been with the organization for 3 y^rs. The proba- 
bility that this employee will stay for at least tUo more years Is 
found by calculating the quotient of t he probabi VVtV-€<^^stay i ng 5 
(3 + 2; years and the probab i I i ty of staying 3 years. ThT^s Involves 
two iterations of the log-probability equation- 



Y^3 = 0.9534 - 
= U.9534 • 
= 0.51913 



0.9102Log(3) 
0.9102(0.47712) 



This converts to a probability value of^ 

p^3 = 0.6982 



And. 



Y+c = 0.953.4 - 0.9102^.og(5) 

= 0.9534 - 0.9102(0.69897) 
= 0.31720 



This converts to a probability value of 

• = 0.6245 

' ■ ' - 172 - 
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Then the probability that an employee with 3 years of service will 
stay at least 5 years IsJ 

Using the calculated probability values gives: 

0.6245 = 0.8944 ' 
0.6982 

Thus, for this occupation, there Is an 89% chance that an emploilee' 
wlth 3 years of service will stay 2 more. In other words. 9 times _ 
out of 10 a selection of someone with 3, years of service win result 
In that employee staying with the new Job for at least 2 years. 

The Important point to note from this example Is the f ference In 
reteritlon probabilities which Is caused snlftlv by d fferent methods 
of selection. In this example we used a log-probabi I t ty equation 
which characterizes a professional occupation. However, the d ffer- 
erJces noted here are even more pronounced In clerical occupations 
wHere there can be a 10 to 1 retention dl^fference between selecting 
an employee already on board or hiring frpm the outside. 

V 

\ . 

( 

i 

L 



-173 - 



0" 



. , . . • Page >* , 

Question* How can Individual retention probabi 1 1 tl es f or employees l-n 
a given group be translated Into the estimated number who^ 
win be lost from the group? — 



Suppose that the 1 og-probabl l I ty equation for the glyen^roup i s: 

Y = 0.3S3k - 0.9l02Log(t) V ■ ' 

And suppose that the given group consists of 10 enVploye'^ wi^ose 
length of service composition and"! ndl vl dua 1 retention probablll 
(see previous question) are given by the following table- 





-Base 


Year- 


Base 


year + 1 


RetentI on 


Empl oyee 




Ret 




Ret 


Prob. 


Number • 


L.O.S. 


% 


L.O.S 


% . 


(Rl = 


(1 ) 


(t) 


(P,) 


(t+1) 


(Pt.l^ 


Pt+l/Pt) ■ 




1.2 


.8109 


2.2 


.7395 


.9119 


2 


3.5 


.6766 


k.S 


.6'tOI 


.9'»61 


3 


0.8 


.8512 


l.Q 


.76'»6 


.8983 




2.k 


.7282 


iA 


.6807 


.93'»8 


P 5 


1.8 


.76'»6 


2.8 


. -. 7076 


.9255 


' 6 


'».3 


.6'»68 


5.3 


ifel57. 


.9519 


7 


6.7 


.5798 


7.7 


.5.582 


.9627 


8 


5.0 


.62'»5 


6.0 


.5968 


.9556 


9 


2.1 


.7'»5'» 


3.1 


.6936 


.9305 


10 


1.6 


.7786 


2.6 


.71 76 


.9217 




/The number lost from this group from (base year) to (base year 1) 
is given bys 

N 

L N - 

1 = 1 



From the above table. 



N = the number of employeeS^n the grouf) 

= 10 ... 



N 



I:Ri = 9.339 
i = l 



r,. 



And, 

L = 10 - 9.339 
=0.661 

This value would round to 1. Thus, 1 employee will be lost from the 
group. There Is no way to determine which one It will be. 

- 174 - 



174 



Page 5 

Question; How many cl vM engi neers must be hired by the organization 
^ In this fiscal year In ord^r to have 1 00 of them on board 

five years hence? ' * 

Assume that the log-probability equation for civil engineers Is* 

Y= 0.953^ - 0.9102Log(t) (1) 

The t-value for base year + 5 Is 5.5 years (assuming an ev^en distri- 
bution of hires over a fiscal year). ' 

The first step Is tOr calculate the "percetit remaining" value given 
by equation (I) for 5.5 years. Solving the equation for the Y-value 
associ ated vl th t = 5.5 results \n* 

Y+5 = 0.953^ - 0.9f02 Log(t) 

= 0.9534 - 0.9102(0.74036) 
= 0.27952 

This value Is in standard deviation units. Converting to percent 
givess 



P^^ = 0.6101 



(or 61.01%) 



Thus 61.01% of the starting group hired during the current fiscal 
year will still be on board In five years. ^ 

To determine the number of civil engineers to be hirecf this fiscal 
ar , I et 2 

Nq. - the number to be hired In the 

base year ^ , 

N^5 = the number wanted on board I n 
f I ve years 



Then, 



No ' N^5 



^^5 

100 
.6101 



= 164 



Thus, 164 civil engineers must be hired this fiscal ^^ear |o that 100^ 
of them will still be on board in five years. 
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Questions What Is the threeryear return on a training i nvestment '( A) 

when only new^outslde hires are trained and (B) when, employ- 
^ ees who have already been on board for three years ar^ 

tr al ned? . 

. • ■ ■ ' , . 

Suppo5^e that an organization wishes to train 100 personnel manage- 
ment specialists and that the blllned cost of the training program 
Is $1000 per employee trained. And suppose that the Jog-probabI 1- 
Ity equation for perSjOnnel management specialists is^ 

Y = 1.165 - l.l67Log(t) (1) 

Each of the cases described In the question can be explained by 
calculating two figureis^ 

• ) ■ 

1. The number of employees from the starting group 
of 100 who will be retained by the organization 
three years after the completion of the training 
course; and 

2. The actual cost of the training program per re- 
tained employee. 

Case A - Suppose that all 100 etnfJloyees In the training program are 
selected from a group of new outside hires. ^ 

Three years after the training program, those employees sXl 1 1 on 
boand will have an average length of service of 3.5 years (assuming 
. an even distribution of hiring during a year). 

The Y-value associated with this value of t (i.e., 3.5 years) Is 
given by" 

Y^. = 1 • 165 - 1. l67Log(3.5) 
= ) . 165 - 1. 167(0.54407) 
. ^ - 0.53007 

This value is in standard deviation units. Converting It to percent 
r esu 1 t s i n2 ' 

P^3 = 0, 7020 (or 70.20%) 

Thus, the number of employees remalnPng from the original group of 
100 Is: . 

- 0. 7020 X TOO = 70X ' 

To translate this number Into an actual cost per employee trained 
. f i gure, let: " . 

TC = the original stated cost per 
employee of the training 

- 176 - . ^ 



Then, 



AC 



» ' Page 
the actual cost of the training 
program per employee sti I I on 
board three ye^rs later 



AC = TO 



> 



c $1000 
7020 



= $ I ^25/empl oyee tral ned 



*Thus/ the three-tye^r return on the training Investment when only new 
outside hires are trained Is $1425 per employee trained and still on 
board. v 



CaTse B - Suppose that all 100 employe/es In the tral nlng program are 
eropi ayees who have already served three year's with the organization, 

TJnree years after training, those trafn^^ employees still with the 
organization will have an average length of service of 6,5 years.' 
• The Y-value assocl ated wl t h th^ value>iOf t Is given by! 

^ = 1.165- 1 . l67Log(.6.5) 
= I . 165 - 1 .167(0.81291 ) 
0.2,1633 



Converting this va 1 ue to ' per cent glvesi^ ^ ^ 

* \ " . ' / • 

P+6 = 0.5856 ' ior 58.56%) 

In addition, since the trai nees had already been on board three years 
at the time of the program, thieir retention probabMlty from 3 to 6 
years Is given by* > 



P = P + 6 



.-1 f 




original group of 100 



> ^ •■ . • ■ t ^ » 

=. 0.5856 =iO.J^k2 
. •' > 0.7020 

Thus, the, number o§ employees ren^ajn 
tral nees I s' 

N+6 = P, X 100 . Y- 

= 0.83^2 X 100 = 83 ) 

To translate this number Into an actual cost figure, use t he ' f ormu I a*' 

/■ 

'V " - 177 - 
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Thus t 



AC = $1000 = $n99/employee tralncfdr* 



Thus, the three-year return an the training Investmerit wheft'CM 
who have already served three years . are tral ned Is $1199 per employee 
trained and still on board. \ ^ 




Conclusl on - The return on a^^l^r c^l nlng Investment can cvary significantly 
based on the selection of employees to trained, Mar|Jagement should 
be^made-aw^re of the. <:bst differences Involved so they can make an I nr 
formed selection decl sion . ^ 

■ • •■ V ■ »^ ■ . ..... ■ 



/ 



* 
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Questton^ What are th^ group retention probabM I t I es for employees 
^ In each length of service category? 

•Suppose that the log-probabi 1 t ty equation for a given occupation 

f '* Y ^ 0.953^ - 0.9102Log{t) 

To answer the question we need to ca 1 cu 1 ate group retent I on proba- 
abilities; i-re., the probabilities t^hat employees- In given length 
of service cat^gorj^s dur I a given base year will still be bn 
board at the end of thp next year. 

FI r St 'cons I der thosfe employees In their first year of service 
I.e., from t day to 1 year --during the ba3e year. Jt Is^a simple 
matter to determine their group retention probability?. Just calcu- 
late the Y-value associated with the t-value of 1.5 years. .(This 
Is the same as c^ 1 cu 1 at I ng ,t he retention from an or i gfnal group 
one year, later.) In this case we have- 

. ■ ' Y r 0.953^.- a.9102Log( 1 .5) 

^ = 0,953<* - 0,9102(0.17609) 

This Y~value ror r pond^,* r o ^ r et «=»r>t: I on probabilltV of 0, 7861. 

To determine the probabillMes for employees with more than one 
year of service, the first step is to iterate the log-probability 
equation for the rri\polnt^ of e^ch- length x>f servi ce .categor y be- 
ginning with 1 - 7 yea's; I.e., mc e th^n one but less than or e- 
qua 1 to 2 years. (For convp'Menre sake we will stop at the cate- 
gory 9-10 yeafs. However , t f>p m^^^^oH use<^ would work for every 
length ofservl'-eratogorv.) 



Iteration give*; th*? followir>9 v;^lii*»S' 



L.O.S, 




Retent J on 


( Year..s> 


M i dpoi rvt. 


. Peij^cent 


1-2 


* i 

1.5 


• 78.61% 


2-3 


2.5 


72.28 




3.5 


67.6'6 


4-5 


4.5 


64.01 


5-6 


5,5 . 


61 . 01 


6-7 


6.5 


58b»4 5 .; 


7-8' 


7.5 


56.23 


a- 9 


8.5 


• 5.4.28 


9- 10 . 


9.5 . 


52.53 



In Addition, to . get .a r Vt-ent i on probability for^dur Ifijst group, we 
also need to iterate t^i^ i og- probabi 1 i ty equation^ for the category 
10-11 years 1 Thi s is: 7^ ' * . 



10-11 



10.5 
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The second, and final, ^tep Is to dl vl de each^year's retention 
percent Into the succeeding year s percent. For example, the 
?et^ntlon probability for empIoy.ees With l-2.years of service 
.Is? . 

72.28% = 0.9195. 
78.61% 

Similar calculations for each'length of service caregory give 
the following group rete^it I on probabilities^ 



L.O.S. 

(Years) 


Retention 
Probab'I 1 I ty 


0- 1 


0.7861 


1-2 


0.91 95 


2^3 


0.9361 


3-4 


0.9461 




0*9531 


5-6 


0.9580 


6-7 


0.9620 


7-8 


0.9653 


8-9 


0.9678 


9-10 


0.9699 



These probabilities can be used to estimate the number of em- 
ployees retalntxy ' -> -t^-h Ungth of service category. 
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Question* Using group retention probabi I I ties, how many employees. 
jf wM 1 be pifetarned from a given group one year later? 

Suppose that the I ength-of- serv i ce comppsltlon of a group of em- 
ployees In a given occupation at the end of a given (or base) y'ear 
Is: 

' ■* - 

. - " <^ 



L.o.s. 


Number of 


(years) 


Emp loyees 


0-1- 


TOO 


M-2 


81 


/2-3 


"/6 




71 


4-5 


6/ 


5-6 


6^* 


6-7 


(s\ 


7-8 


58 


6-9 ^ . 


55 


9-10 





This mea»^s ha^ ther^ are 10^^ emp^'^yees in their first year of 
servlc**, PI • f^^<^^r tr^r^>.»'i, v-tr, T*w*^ a total of 6^5 employ- 

Suppo^i^ t-i»s»r t 1 oo - pr n^ t 1 * t y equation for this group i^s^ 

Y - 0.9^^h - 0,910?Log(t) 

Using pr ev i ous 1 y-exp 1 e i rx'd Cerhnlques, the group retention proba- 
bilities as <;oc i ?rt *>a with this ) og-probabi Ij ty equation are- 



L.O.S. 

(years) 




Retent j on 
Probabi 1 i-ty 

. '.0.. 7861 ■ 

0.9195 

0.936L . 

G'.9'»6f 
♦0.9531 
. 0.9580 

0.9620 

0.9653 

0.9678 

0.9699 
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crj f, l"l retention P-n.bHUv 

the base year, (b) the retent-i on probaoility each <^3^^9°^y ° . 

Ind the estimates of employees retained are moved down to their n^ 
1 engtn-of -servl ce category. 



L.U.S. 

( years) , 

C- 1 
I ? 

7 

7 ft 
o 1 n 

I 0 M 



dase 
Year 

i oc ^ 

R] 

/■ 

7 i ^ 

s ■■■ ^ 



Retention 
Prob., 



( . /861 ). 

( . 9 3 o 1 ) , 
( ) . 

( . 

( . An 

( .^^70). 
( .9653 ). 
( .9678 ), 
. ( .9699). 



3ase 
Yr + 1 



73 
7^ 
71 
67 
■6U 
61 
S9 
56 



SO 



The estimated number of employees retained at the end of bas^ year .1 
!s .63i*- The retention rate for the ent. re group .s- 

63^/68S = ,92SS (or 92.99%) 

Conversely, the est i mated numner i oVt during base year-1 Is. 51 (685 - 
63^)^nd the lo*;s rate i*^-' 

51/685 - .0 /^5 (or 7.^5%) • , . 
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How do 

rate? 



changes In hiring levels affect a group' 
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Suppose that the log-probabi I I ty equation for a given occupation Is' 

Y = 0.$53A - 0.9l02Log(t) - 

Suppose further that over a period of years there has been a steady 
number of hires Into the occupation, say 100 hires per- year. Then 
the length of service distribution at -the end of the. base year, for 
those hired over the past five year^ would probably look something . 
I I ke this J 



Years SI nee 
HI re 

5 
k 
3 
2 
1 



Number 
Remal nl ng 

63 
66 
69 
7k 
81 



This gives 4rtotal of 353 employees. 

To, estimate the number of these employees who will still be on board' 
at* the end of next year (base year + 1), » <i can use .the group reten- 
tion prob<*bI n tl es assaoJated with the 1 og-probabi 1 1 ty equation. 
These are (a<; r^)ru)ated a previous question)* * 



Years Since 
_Hljre 

5 
k 
3 

™? ' r ■ . 

1 



Retentt on 
ProbabI 1 1 tv 

0,9580 

0.9531 

0.9461 
^ 0.9>61 
^ 0.9195 



After multiplying the number In each length of service category by the 
corresponding retention probability, we haveS • • , 



Years Since 


Number at End 


Number at End 


Number 


HI re A 


of 0ase Y«ar 


of Base Yr +1 


Lost 


5 


63 


\o 


3 


k 


66 


63 


3 


3 


6^ 


65 


k 




7 k 


69 


5 


1- 


8.1 


7k 


7 




3 S3 


331 


22 
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Thus, of the 353 starti ng ^employfes, 22 wMI be lost during the 
year . ' . f 

Now let^s hold tWs figure constant and see what effect different 
levels., of hiring will have on the occupation's turnover rate. 

Suppose that during the base year the same pattern of hiring con 
tinues l.e.f 100 employees are hired. Of these, 79 wl 1 1 be 
left by the end of base year 1 (using the log-probabi 1 I ty equa- 
tion).* This means that, 21 of the new hires will be lost by the 
end of base year 1 . Combining this with the losses from the 
other length of service categor I es we_fiave2 

21 + 22 = ^3 losses ^ 
. ' out of 

100 + 353 = ^53 employees 

This gives a tur^^^r rate of- 

^#1^' JlI = o.09i*9 (or 9.^9%) 

Now suppose that Instead of hIrl'Rg 100 employees durTng the, base^ 
year, management decides to hire 200, The nmnber remaining from 
this group at th^ end of base year + 1 would be 157.^ Thus, **3 em- 
ployees w<^vld be \o^t. Comb ining this figure with the .1 osses f rom 
the ^ I ^ , I . r f T o . V I o r At- pQor ? e<^*-^*^I ch r ema I n the sarne) we 

hp * 

♦ 72 ~ 65 lo^<;*>^ 
^u t of 

65 - 0 . n 7S (or 1 K 75%) 

,5S1 . • . 

Continuing In the same manner, w<> get the foHowIng turnover rates^ 
for dif^rent leveU of hirinq: 

Number Hired Turnover Rate 
. Base Year - Base Year ♦ 1 

0 6.23% 
50 8.19 ' 

100 9,49 

200 1U75 

300 13-17 
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Thust Increasing the number of new hires Increases the.turnover 
riate, Converselyt decreasing hires lowers the turnover rate. 
This. Is a logical result since most turnover occurs during the 
early years of service. , 

It Is also worthwhi le to note here, that a freeze on hiring will 
lower a group^'s turnover rate. This Is Important to realize , 
since decisions to freeze hiring are usually coupled with plans 
to let natural attrition take care of lowering employment levels. 
However, most calcul^Ions of the rate of this lowering of employ- 
ment are Dased on previous attrition rates and, thus, are usually 
wrong. The lowering of attrition rates In these cases must, be 
taken Into accoupt. . 

SInjIlarly, In a RIF situation, turnover rates are going to decrease 
both because no new hiring Is done and because the first employees 
to be RiFed are those with fewer ye^rs of service. 
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APPENDIX C-l 



TECHNICAL ANALYSIS 



( 



STAFFING NEEPS PLANN,ING COMPUTER PRCGRAM: 

1 OGPRO , ^ 
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LOGPRO TECHNICAL ANALYSIS^ 



IntroductJLon ^ . 

The coinputer tenninal program LOGRRO was developed to enable us er^ to: 

(1) perform a quantitative analysis of group turnover and 
retention rates (based on employee length of service); and 

(2) use the results of this ari-alysis to project these rates 
Into the future. 

Since the pajority of all vacancies are caused^by turnover (rather th^ 
by growth), the accuracy of projections of future hiring needs is 
directly related to the accuracy of turnover/retention analysis and 
projec-tion. C ^ 




LOGPRO and its two .subprograms kSSNDXP and ANDPX) together form a 
comprehensive, self-contained unit which completes all the necessary/ 
statistics required for the analysis an<t-projection of turnover/retention 
rates. This means that a user of this set of programs need not have 
an extensive statistical background in order to., successfully utilize 
and evaluate its results. - ' 

Data Collection and Input 

For use in LOGPRO, turnov«»r /retention .data are obtained By determining 
the numbers or percent of employees retained from a given group of 
hires after specified lengths of service. More precisely, data for 
the LOGPRO analysis method are collected by cohort group ; i.e., by 
following the retention behavior of the same group of employees over 
time. This type of data is^ also known as longitudj.nal data. 

Icfe^ly^ cohort groups should be composed of employees in the same 
occupation. However, if the number of employees in one or more 
occupations is small, then it is possible to combine like occupatioiis 
(e.g., all scientists). ^In addition, ther6 are techniques available/ 
,which will statistically determine which occupations may be grouped 
together. (See documentation for LPTEST.) 

■4 ■■ . . . < . . ■ . . j - 

^rom each cohort to be analyzed, two types ^f information must be 
recorded: ' " • 

(1) The number of emplpye^s in the original group of hires; and 

(2) The number (or percent) bf^ these employees retained after 
given lengths of time. 

The ''original group" is composed of employees 'hired during a specific 
span 6f time. This span should be limfted to on^ year or less. The 
Mgiven lengths of time" co(iy be: any standard time units (years , 
jiionths, days) after the^end of the original time spaa. Note that, since 

^ -191 . • ♦ ' 
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Sample Dat£L 




Year 


Z Lost 


1 


42 


2 


56 


3 


64 


4 


69 


5 


74 


6 


• - 77 


7 


79 


■ n 
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Sample Data 



Y/ar 


^Retained 


1 


58 


2 


44 


3 


36 


4 


31 :. 


5 


26 


6 


23 . 


7 


^21 



3 4 5' , 

Years of Service CcHnplet.ed 
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the» original group will contain employees with different lengths 

of s'ervlce '(from 1 day to 1 year) , there are several options InvolMid 

in assigning values to the time units. . / 

For exatitple, consider a group of employees hired during a given fiscal 
•year ending Sep tatther 30th. And '/suppose that retention data^ for thig 
group are, recorded on succeeding' September SOths.v The time values for 

these data points imay be:. , . 

* . ' * . 

(a) 1 year, 2 years, etc. (calculated from September 30th to 
September 30th); or 

(b) 1.5 years, 2,5 years, etc. (assuming an even distribution* 

'. of .hires over a year and using the jnidpoint; i«e«, 0.5); or 

• 

(c) l.X years, 2,X years, etc . ''^( where X is a factor 'Based Upon 
speci£it hiring pattern^, such as more hiring, done at the 
end' of a fiscal year); or 

(d) l.Y years, 2.Y years, etc. (where Y. represents the actual 
average length of service of the original group at the ^nd 

of the first fiscal year) . ^ » 

■ . <. 

V The Shape of the Curve 

The key charactOTistic of a turnover curve can be summed up in a^simple 
rule <©f thumb:/ \ . * ' 

Between ftwo^-thirdfe and three-fourths' of all of the turnover 
losses tlhat will ever take place from any giyen group •of 
h:tres will hav^^^^en place by the end of ^he third year 
a|ter.hiAing. And of that totjj^, roughly half will have / ^ 
taken pl^ce by the end of the first year alone. 

What' this >say& is that: - 

is) Turnover is a function of length of service; and 

(h) Jlost turnover qpcurs during the eiarly years of employment. 

This key characteristic of the turnover curve gives^it a partiqulkrly 
characteristic shape. In^Figure 1, "years of service completed" is. 
plotted along the horizontal axis and Tpercent lost^from, origin^^l 
group", is plptted along the vertical ams. It is 'Imiediately ^ 
observable ' that the curve bends ^sharply during the fi^^st three years 
'and then levels off. ■ % ' 

In frthe /same manner, a graph of a retentioti curve (since. 4C^tent ion is 
simply "number of employees in the or igijOi'al group-" minua^namployees 
lost from the group") shows the same character isticis. ^igici^ 2 , plots 
the retention curve associated with the turnover curve irf^Fi'gut^ Ii 
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this caae, the vertical axis represents "percent, of ch.e original * group * ^ 
retained*" Fpr purposes of- the following analysia discussion, the 
retention curve will be used, s . * ' - \ 



The siiwplest way t& analyze such a curve is to transform the rel^eVant* 
variables so that the data points form a straight ^line. Then analy^ft^^s 
Can be done by the^se of a least squares technique. In the case o5 
the retention curve, the transformations 'are 

(a) Percent of employees retained is transformed -intp number . of 
• standard deviations from the mean of a normal curve ; and 

' (b) Years of service is ^ transformed into the logar ithro of years . 
, ' of. service^ • . " ^ a 

These two transf onpations give this analysis method its^ name: the 
log^probabili ty technique. • ir ^ ■ " ' 

When the values in Figure Z^rt plotted on log-probability graph paper 



tshe result is a straight 'line (Figure 3). 



^ •■ / , \ • 

The basic as^sumptions behind the use ^^f the log-pr#jDability technique' 
for retention analysis ^ir*> thrppfol-H, "''vV ' ^ , - • 

C'a. • ^ * 

First,, the given groyp possesses a normal distribyion of-factors - 
motivaMng .group ^tetition and dyarTuro befffa^vior^ Th iis. a|^ximp t i ort is 
base,d ilpon th^ extreme complex! tJ of the interaction between ^n ^employee 
and his or her job situation. Any. one person can be motivated by any ^ 
number of "stay" and "go"- factors. For analysis purpQsab, the sigriifi-- 
carpt aspect of this multiplicity €f retention-producing factoi^ Jls^ not 
their "identity, but their distribution ^among' the worl^roup members. ^ ' 
New hires with extremely "a'nt\|.-thls-j6b" ^ttirt^des, for example, could 
be expected leave l^mmediatefly , regarjlless of what the jp articular, " \ 
reasons fot their individual attitudes Trnight be. Tlfeae^who -wer e " very,a 
:^Ipro-this- job" would stay, regardless' of' their reasonB.:; And the re8|: , 
would be found somewhere between. Under this schema, • then , the very '> 
• multlplicit;y (and the mutual independence^ ^of the employee retention/ 
loss factors which so impossibly complicates their identification shouW 
ensure that a frequency distribution of their ^appearance tn* the wor)cgr^p • 
should approach that of the normal ' cur\fe. " 1/ , ' ' • 

.Second, the .behavior resulting from these retention mbtj.vating factors ' 
will be displayed logar ithnyica^ly, rather than 'a^-ithifletically, ' over-' time .- 
Thii3 assumption is based Upon the distribution' of 'turnover . as expressed " 
- in eviously motioned rule of thumb. /The l^r^gestrbulk of turnover * 

Ij -Clark; ^H/L. , *'t^roblems Shd Progresa^ i,n Civil Sfervioe Manpowef* Planningko 
in. the United Staters." In: . /Manpower Plantilng ^lode ls (Clough, Lewis'^; 
6c Oliver Eds), London, English Universities Pr^ss^ 1974, pp.« 227 -^i^'. ,-^ 



l<y88©^H:>ccur^ in the first years, of setvi'ce with successively fewer 
losses occurring, as years g6 l?y. Thie^ is a logarithmic progression. 



lliird> the situational factors affecting t^he retention behavipr during 
the initial period of empirical observation will continue substantially 
unchanged throughout the remaining, period of the projection, . This 
asQinnption requites that there be np extreme change in the working 
situation of the cohort groups, Sjiich changes would-include large-scale 
reorganizations or prograia changes. ^ ' ^ 

Statistical Analysts 

The exact mathematical form o| the retention curve (also known as a 
"decay" cur>ye) is given by the linear equation: ; 

- " ^ Z(Y) = A + B LogX ^ ( 

\ where : . 

X T= length of service completed, ' 
■ ■■ ' .V . . • 

Y = percent of original group retained ^t 
point X. ^ 

Z(Y) « location on the normal cui;ve of 

the retention point Y, expressed in 
standard deviations distance froln the 
' mean. 

In order to obtain, the data which can be used to perform the linear 
regression which will determine the values of ^A .aaid- 6 in; equation ? 
(1), LOGPRO makes the aforementioned transfoihndtions of the inputted 
retentipn data (Page 5^. • 

Th^ first transformation uses the accumulative normal .distribution 
function to determine both (a) the standard deviation value associated / 
V/ith a given retention percentage (used in least squares calculation) 
"and (b) the retention percentage value associated with a given stand'ard 
deviation (used in curve iteration and goodness of fit calculations). 
The mathemati^^al formulas which acromplilh this basic two-way trans- 
formation aro Honp by T.OCPRO' two svibprograms ANDXP and ANDPX- 

ANDXP - This subprogram uses an approximation of the accumulative normal 
distribution function to estimate P (percent , retained) *from a giv^n value 
of X(standard Hipvif^Mon^ from the mean). . ^ t 

The form of this ap'proximalkron 2/; using IL^ -^^ ls^ < , 

P(Xj^) = 1 - Z(;X^)(bj^t + h^t - + bgt^ "+ b^t^ + t5t^) + e(Xi) . (2) 

2/ Abramowit'z, M. and StegUn I.A. , Eds, Handbook of Mathematical Functions 
~ AMS55j 9th, Natiojial' Bureau of Standards, 2>..2.17, p. 932. 
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t = 



.1. 



1. + ..2316419JC, 



fage 7 



= 0.3193815 



b2=-i). 3565638 



1.7814780 



b, =-1.8212560 
4 



= 1.3302744 



If the value b£ the inputted X is greater than or equal to zero, then 
the value P(X^) calculated from equation (2) is returned to th^ie main 
program. If X is less than zero, the the quantity (l - P(xL')) is ^ 



program. 3^ 
returned. 



The error funttion (e(X]^)) is accurate t5 7.5 X ^8 

ANDPX - This subprogram uses an approximation of the invei^se accumulative-! 
normal distribution function to estimate X from a given value of P. 
The form of this approximation 3^/ is: . 



X(P) = t -/ao + aj^t + a2t^ \ 
\l + bit ^ b^t'^ ^ >^3t^/ 



+ e 



(p) 



(3). 



where : 



In 



p7 




0.802853 
a2 = 0.010328 



3/ Ibid., 26.2.23, p,. 933; 
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= 1.432788 
b2 =0.189269 
b3 = 0,001308 
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Page 8 

X ■" ^ , / 

If the inputted value, of P is greater than or equlil to 0,5 (i.e, 507o) , 
then the actual value of P ir used in the calculation/ of t. If P is 
less than 0*5; then the quantity (1 - ,P) is used. . / ^ 

I '' ' . ■■ ■ 

The ^rror function (e(P)) for this approximation is accurate to -f - 
4.5 X 10*^. To increase its accuracy for u^e by LOGPRO, ANDPX contains 
an additional refinement which makes use of the additional accuracy 
available in ANDXP. 

To accotnplish this refinement, this Subprogram ANDPX: 

* ' ■ • *j ' 

. (1) Takes the value of X computed using equation (3) and adds to 
it + 0.0005 to determine two new values: X^^ and 'X2;- This 
> establishes one interpolation range. 

(2) Uses Aljj^XP to determine the percentage values associated with 
Xx and X2« This establishes a second interpolation range. 

Oy Usesythese two ranges to interpolate a new X value , X3 , to 

whiq^i it applies ANDXP and determines a percentage value, P3. 

i 

(4) Computes the differenc^e, D, between P and P3. This difference 
^ is th^ tested to determine if it falls below 5 X lO"'- " ^t 
does not, then it is recomputed (using new Xi and Pi values) 

until it does. . * ^ 

. " Ji 

(5} Returns the Xi value associated with 'the minimum difference to 
LOGPRO. * 

This series of steps i^ncrea^es the accuracy of Ajft)PX to sevell^ places. 
'\ Computations ^ 

After transforming the inputted /retention data into least squares form* 
LQGPRO perfoirms « series of <^<'»TTjpi^t ations : 

(1) The transformed ^atf» Ar^ used to rr^lrnlat^;^ log-probability^ 
r e t<^n 1 1 or> oquat Ion . 

(2) The value of the regre<?s1on f^qiiation is calculated for each 
retention point. This value is changed into both Estimated 
number anH percent r et- ned - 

(3) The squared deviation of each retention point from the lin^ * 
of regression is calculated and used to compute the standa^^ 

* deviation of the fit. 

(4) At operator option, the log-^proj^ilitiTregression equation ^ 
can.be projected into the future?^ Staifrfard extrapolation 
procedures are used. The operator specifies the desired 
projection parameters. 
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Page 9 



. (5) If the numtjer of retention points inputted is /greater tha^i 2, 
^ the 95% confidence range is calculated for each pi^ojected 
- point (using the standard deviation of the fit). The ranger 
' values are computed ^n both number and percent form. 

—Applications* to Other Areas 

Log-probability retention lines provide insight into a rtumber of 
management and personnel problems, - For example, they can be used to 
estimate-: ^ , • 

)ver expected in specific pla^i'ning periods (ei.g. , l77o in 
first yea/,. 97o in the second year, etc.).^ 



Turnov er 
the f ] 



Current Hiring Needed in order tq have specif ic*-numbers of 
workeirs onboard on a particular date in the future (e.g., 
160 hires. now to have lOO on board five years' hence) . 

Budget" necessary for current hires still present at, each . 
intermediate step tosyard- target (e^g., 133 at start, of sec<5nd. 
year, 120 at start of third year, etc.). 

Turnover.^ Imp 1 lemons of selection method elected by management 
(e.g., first-year quit rate^of 177o for a new hire, 67*. for 
selection of 3-years-service employee, etc.). " 

Training Cost implications of ei^ployee Be^ection methods such 
as "three-year return on training inves^ent" (comparative 
training investment per worker on board three years after 
training is appreciably; higher for a new Qutside hire than for 
an on-board ethployee reaR«igned or promoted for trainin'g). 

Personnel Cost implications of budgetary coiWtraints such as 
hiring freezes, employm^.nt cutbacks, etc. (Usually result 'in 
a decrease in the n-mber of lowMength-of -service employees in 
the worV-fnrr^. tb^'n <lr-Tpnn?Mp ^ r^'^v^*^ t ^rnovpt' rates). 



/ 
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STAFFING NEEDS PLANNING QOMPUTER PROGRAM;^ 

lOGPJiO ,v 



i 
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UNITED STATES CIVIL SERVICE CQMMISSICN 
\aSHINGTCN, D.C. 20^15 



\ 



-/ 



201 



- > 



ERIC 



'tsMifi^--. APPENDIX C-2 '■ ■ ^ V . ^ OPERATION MANUAL 



J- . . LOGPRO INSTR: P(1) QF (7). 

^"fioGPRO" IS A FORTRAN IV PROGRAM WHICH' PERFORMS LOG-PROBABILITY > 
.RETENTION ANAL YSJ S. IT Rf QUI RES" TWO SUBPROGRAMS: ANDXP AND 
I ANt^P'X^ \ . 

BsAfA requirements: , > . ' ^ ,, " 

THE .".LOGPRO^ PROGR/^MING SEQUENCE I S^'D^SIGNED, JO USE LONGITUBTNAL 

«Setention over tiHe-.data. such d'ata are. obtained .by observing 

TiH£ RETENIION BEHAVIOR OF ,A GROUP OF EMPLOYEES HIRED DURING A 
GtV^N PERIOD .OF JImE, SAY. ONE YEAR. THESE 'SAME EMPLOYEE? ARE 
THEN 'FOLLOWED OVER' TIME TO DEfERril'NE- HOW. MANY. (OR WHAT PERCENT) 
;GiF THEM REMAI?N IN THEIR JOBS AFTER 'GI VEN 'PERipDS OR TIMEjjE.G.. 
, BNE YEAR, two YEARS. ETC. MALYSIS OF THES^nPES OF DATA IS 
;. 4tS0 KNOWN AS "COHORT AhJALYSIS". . - • 

Slt'hoJgH. ANY "GrLuP .OF EMPLOYEES" MAY BE USED IN THIS PR^JGRAM, 
'^ COHORT ANALYSIS: OF THIS TYPE 'IS MORE ACCURATE IF THE EMPLOYEES 

IN THE GROUP(S). SELECTED ARE IN THE SAME OR SIMILAR OCCUPATIONS. 

FOR EXAMPLE, ONE GROUP COULD-BE ALL' CLERK-T YP I STS HIRED IN FY73. 

ANOTHER C0ULP BE ALL ,.SCI ENTI STS HI|ED IN FY73. . . 

THE PERIODS OF TIME US Et) COULD BE ANY STANDARD TIME UNITS (YEARS, 
MONTHS DAYS) OR ANY^FR'ACTION THEREOF. BECAUSE MOST COHORT, 
SrSIpSWILL CONSIST, 0 WITH VARYING UgTHS OF SERVICE 

(FROM l^AY. *rO V; .YEAR),. AN AVERAGING FACTOR ,CA^f BE ADDED TO 
THESE T,mE UNITS TO MORE CLOSELY -APPROXIMATE T^E ACTUAL AVERAGE. 
LENGTH OF SERVICE;OF "THE COHORT^ (E.G., FOR A GROUP 'OF EMPLOY;EES 
HIRED IN AN EVEN DISTRIBUTION OURI.NG A FISCALWYEAR, THE .fACTOR 
WOULD BE 0.5. .YEARS) . - ' 

■ ■ ' ' . '■ ." . i ' \ 

THE CURVE FORM USED IN "^LOGPRO" IS KNOWN AS THE "LOG-PROBABI Ll TY" 
JuRVE. THIS CURVE IS USED BECAUSE I T /MOST CLOSELY' APPROXIMATES 
ACTUAL RETENTION BEHAVIOR.. 

HYPOTHETICAL DATA SET: 

THE rOLLOWiNG IS A H^Y-PdTH^T I CA}. SET. OF DATA OF THE TYPE REQUIRED 
FOR THE "LOGPRO" .PRCGRAMMlNCv sequence: - ' 

SUPPOSE no cIIeRK^ TYPISTS WERE HIRED OURUIG FY1972 AN'D i 
SUPPOSE, THESE Vl IRES WgRf MADF EVENLY THRCfUGHOUT THE 
,FIS£AL YEAR AND SUrPrfSf • THAJ 

" AT THE END OF F.YJ THE NUMBEr' RETAINED 'WAS : 
1973 58 (OR ,52^73%) 

197it (OR ^0.00%) , 

. ,1 975 36 (OR 32'.73X) 

THE AVERAGING FACTOR FOR THIS DATA SET IS 0.5 YEARS; . • 

THIS SET OF DATA WILL BE REFERRED TO THROUGHbUT THIS MANUAL. 

» ■ ■ ■ • j. 

EXECUTION CQmANDSJ , , . ^ 



TO BEGIN EXECUTION Of THE •'LOGPRO" PROGRAMr.ING SEQUENCE . AN. 
OPERATOR WILL PERFORM A CHAIN OF EXECUTION COMMANDS.. THE ' 
AfiTUAL' FCRM OF THESE COMMANOS WlLl DEPEND ON THf T IME-:SHAR ING 



UJtER WRITTEJil- 



■1 k^' 



l^^'lifSTEH BEINq USED. IN — 
>fijE FOLLOWI NG OPER^T I ONS : 



.LOQPR.O INSTR:.:P(2) OF (7) 



THESE COMMANDS WILL PERFORM 



- CALL UP ■••LOGPRd''..AND ITS T^O SUBPROGRAMS TANKXP" , 
AND "ANDPX" AND TRANSLATE THEM INTO MACHINE LANG?'. 
UAGE. THIS IS. THE COMPILATION PHASE. - ^ 

- LCAD THE COMPILED PROGRAM- ANiD. SUBPROGRAMS, INT( 
THE CENTRAL PROCESSING AREA'^i/\ND START PROGRAM 
RUN. THIS IS THE EXECUtlON PhASEi 



DATA entry: » 

(NOTE THAT ALL OPERATOR-ENTERED RESPONSES TO COMPUtER-WR I TTEN COM- 
MANDS ARE FOLLOWED BY A CARRIAGE RETURN'.) 

T.^fE RUN OF THE "^LOGPRO" ANALYSI S' . SEQUENCE B.EGINS, IN THE FOLLOWING 

m'^vNner** • 1 . . 

* please enter the number of years for-which retention data" are^ 

* Available i 



THE Operator RESPONDS to this command by enter ikTg the number of 

YEARS (OR, IF DATAWARE IN OTHER THAN YEARS, THE NUMBER OFJIME 
PERIODS) FOR WHiCl^ COHORT-TYPE LONGltUDINAL DAIA ARE" AVAILABLE. 
IN THE CASE OF THE' HYPOTHET I CAL DATA SET,. THIS ANSWER WOULD BE 



■3" 

NEXT TW 



EOMPUTER asks; 



* Please ENTER THE number in the starting group 



■ ^ 




.JIS POINT, THE OPERATOR ENTERS THE NUMBER OF EMPLOYEES WHO . 
^OSED THE ORIGINAL GROUP OF HIRES FOR WHICH COHORT ANALYSIS 
IS BEING PERFORMED. USING THE HYPOTHETICAL DATA SET WHERE 110 
EMPLOYEES. WERE HIRED IN FY 1972 AND FOLLOWED DURING THE NEXT 3 
'YEARS,, THE RESPONSE TO THE ABOVE QUESTION WOULD BE ^"110". ., 

next: . . 

* wIll input data bf in . . . .• 

* ( 1 ) number op 

* (2) PERCENT 

* (ANS 1 OR ?) 

IF THE RETENTION DATA ARF IN THE FORM "NUMBER OF EMPLOYEES RE- 
TAINED FROM THE ORiniNAl. GROlir-, THEN THE RESPONSE TO THE ABOVE 
COMMAND 15^ "1". 

ON THE OTHER HAND, IF THE DATA ARE IN THE FORM "PERCENT o'f ' 



EMPLOYEES RETAINED FROM THE ORIGINAL GROUP 



THE RESPONSE 



^ IS "2". 

NEXT THE COMPUTER ASKS FOR THE RETENTION DATA IfJ THE FOLLOWING' 
MANNER,: _ 204 - " 
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\ :A:d)- LOGPRO I NSTR ! " P ( 3 ) OF ( 7 ) : / 



W^-M^d^^ VALUES OF X (TIME^rUND Y (NUftlBEft O.R PERCEj^T 

i^v|# ^i|;T)Vp#SeY^TlME J() m- THE FORM X.*Y, SHOWING ALL OECIMAL POINTS 

iit«fe!A-/ii«i>i^#J/$-:^^j.: ; ^ lETENTlON-OVER^'T-IME €ATA 

tE ENTERED AS (X.Y) PA^RS 
REl^RESENTS THE NUMBER 

t;:iiR: rC^ce'nt or the original croup- of employees remaining at" - 
•itfrtiE'^x. , . ■'■ ■ •, ■ ■ ■ ■ ■ ■ ■■■ 



1 . ' ■ ; 




• a • 



IF- the X-VALUES ARE IN YEARS, THEN r.HE ENTRIES FOR THESE VALUES 
ARE IK THB FORK l.X, 2.X, 3.X, ETC. (WHERE X REPRESENTS THE AV- 
ERAGING FACTOR). IF MONTHS" ARE USED, THEN THE ENTRIES CAN BE 
IN THE FORM (A) 3.+Y, 6.+Y, 5.+Y MONTHS (WHERE f = X TIMES 1.2) 
' OR (B); .2^+X, .5+X, .75+X YEARS. THE X-VACSUES 00 NOT HAVE TO 
'BE EQUALLY SPACED. A ' 

IF THE ANSWER TO. THE PREVIOUS QUESTION HAD BEElJ "l". THEN THE Y 
VALUES ARE IN THE FORM "NUMBER OF EMPLOYEESJ^ETATNED" . IN THIS 
CASE THE ACTUAL NUMBERS RETAINED ARE ENTERED INTO t>4E PROGR'AM ' 
-(SHOWJNG A|,i DECIMAL POINTS). USING THE HYPOTHETICAL DATA. SET, 
,THE X,Y ENtRIES WOULD BE: ' " • 



1.5,58. 
. 2.5,A4.' 
> 3.5,36. ' 



itr THE ANSWER TO THE PREVIOUS .qUeSTION HAD BEEN" "2", THEN THE Y * 
VALUES ARE IN THE FORM "PERCENT OF EMPLOYEES RETAINED". IN THIS 
CASE, THE J>ERCENTAGES ARE ENTERED IN THEIR DECIMAL FORM; EvG., 
52.73% IS, ENTERED AS .5273. USING THE HYPOTHETl CAL ' DATA SET,- THE 
X.Y ENTRIES WOULD BE: 

1 .5.,. 5273 • - - . 

- 2. ^,...4000 ^ 

3. 5,. 3273 ^ 

LOG-PROBABIL-I'AaNALYSIS: . ... 

THE COMPUTl?h THEN PERFORMS ALL THE NECESSARY LOG-PROBABILITY ANAL- • ^ 
YSIS FU^TIONS. THf^E INCLUDE: 9 

. - FITTING THE LOG-PROBABILITY CURVE; > 

C - COMPUTING THE LOG-P-ROBABI L I TY CURVE VALUES 

. (BOTH IN NUMBER AND PERCENT) FOR EACH TIME " . 

: • - PERIOD ENTERED; AND 

- COMPUTING THE STANDARD DEVIATION Cff THE LOG- 
PROBABILITY FIT (IF THE NUMBER OF OBSERVATIONS 
; IS GREATER THAN 2) 

THE COMPUTER THEN PRINTS THE "TABLE «F LOG-PROBABI L m ANALYSIS 

RESULTS" WHICH SUPPL I ES^T^IE OPERATOR WITH ALL OF THE RESULTS OF ' ^ 
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, LOGPRO !N$TR5 PC*) OF (7) 

* 3 COMPUTER WRITTElsi ' V , ^ ' 

ANAmiS. IN ADOITION^TO PRINTING GUT THE JCTUAtgAND 
•VALUE'S, ^THE COrtPUTER 'ALSO PRINTS OUT THE A AND P VALUE'S OF /THE . 
/L^OG^-pSoBABILUY E^JUA^ION^AN^D the value of THE STANDARD DEV^IATION 
OF THE FIT. . NOTE THAT THESE LATTER ,3 VALUES ARE IN THE FORn , 
.•iSTANDARD DEVIATIONS, l^OM THE MEAN OF THE fiORMAL CURVE".. (SEE 
^PAGE 7). ' , 




THE* FORMAT OF'THIS PRINnHiJ (USING THE HYPOTHET ICAL CTATA S^t) 
qp RESULTS is: 



: * 
* 

'* 
I* 

.* 
* 



xxxxx.xxxxxxxxxxxxxxxxxxxxxhxxxxxxx)fxxxxxxxxxxxxxxxx5<xxxxxxx.xx 

, TABLE OF LOG-PROBABILITY; ItN'ALYS IS .^RESULTS , , - - 
L . r).--- / ■ ; 

P CURVE-- 
PERCENT 





k . 


'../ 






--ACTUAL -DATA-- 


--L - 1 


YEAR 


NUMBER 


PERCENT 


NUMBER 


1 .50 


^-58! 


52.73 


57.88 


2.50 




4o:oo 


44.30 


3.50 , 


36. 


32.73 


3 5. '83 



52.62 
40,27 

3*. 50 . 36. 32. 73 35.83 /532.57 



* 
* 
* 



* . 

* TH 
* 



* THE 'NUMBER IN THE STARTING GROUP WAS: nO 

l£ LOG-PROBABILITY EQUATION IS ' / 

^ = " 0.31317 + ( -1 .40587) 

TUFy-STANDARD DEVIATION IF FIT IS 0,P^6205 , i ' 

*xxxyxxxyxxxxyxxxxxyxy>'"^^xxxxyxxxxxx)fxxxxxxyxxx)^xxxxxxxxxxxxxx 

\ 

lO.iFcri nN«; : . 

tJe next PORTION OF THE "LOGPAO" PROGRAMMING SEQUENCE DEALS WITH 
THE PROJECTION nr THF PREVIOUSLY CALCULATED LOG-PROBAB FL 1 T Y CURyE 
A PRHJECTIONIS A -iiionwiNr, rORWARO" T MF TRFNO OF PAST pATA 
iNin THF rinHPF. ^ ^^^-^^ 

TN "I OGPRO- THE "SE OF PROJECTION T ECHN I QUES .ALLOWS' AN OPERATOR 
TO FORM ESTIMATES Of THE NUMBER OF EMPLOYEES FROM THE ORTGINAL 
RRDUP WHO WILL BE RETAINED IN TH^IR JOBS AT GIVEN. POINTS IN THE 
fXe S^NCF SUCH PROJECTION FIGURES ARE "ESTIMATES" AND NOT 
EXACT V At UES. "LOGPRO" ALSO COMPUTES (FOR GROUPS WITH MORE THAN 
TWO TIME VAlUES ENTERED) THt 95% CONFIDENCE RANGE VALUES IN BOTH 
NUMBER AND PERCENT. THIS m'FANS THAT, FOR A GIVEN PROJECTION 

- 206 - 



20 i > " 



■ • ' • • LOGPRO INSTR! P(.5) OF (7) 



COMPUTEIR WRiTTEN 



■■■■ ■y; 



PmMT*. THiE , PROGRAM WILL CALCULATE A RANGE OF V/^LUES WITHI^^ , 
iWHliGH JH^ ACTOAL, NUMBER/ RE^^^^ AT A GIVEN POlNf fl.LL FALL 

:-^«-'OF-'tHE' tiME.' .■'/./"■■ ■ \ ''■ '''/('"' 

■ THE PROJfehONS S£QUEN.CE;^^M^ AS FOLLOWS: .; ' '] -r'^-" > 

IS A;|RbJEeTION ppiRE^? (YES=Y. NO=N) J: i. ; ' , ^ 

IF NOr StieH PROJECTION IS WANTE4), THE RESPONSE . TO THE QUESTION 
K »N"* THE COMPUTER WILL THEN PROCEED TG THE NEXT PORTION 
: Of jJHE PROGRAM.. 

* IF, A PR0JECTfi[5<< IS OESIRED, THE RESPON&|/f$ "Y'T THE COWPUTER 
THEN PRINTS OUT THE FOLLOWING INSTRU^T;KnS: , ^ 

* PLEASE ENtkR THE" MINIMUM ANO MAXIMUM VALUES 6f X DESIRED 

* PLUS THE DE§IREb J<-INCREMENT (E.G., -.5 TEAR, 1 YEAR, ETC.) 

* IN THE FORM* MI N, MAX, INCREMENT (E^G., 5.,15.,1.). 

* PL£AS| show ALL decimal POINTS. ^V-^ . 

AT THIs'pOINf THE OPERATOR ENTERS THREE VALUES: , 

1. THE FIl^ST (I.E., MINIMUM) TIME VALUE FOR WHICH A PRO^j^- . 
JECTION IS W;iNTEDj E.G., 5TH YEAI^; ^ 

2. THE LAST (I.E., MAXIMUM) TIME VALUE FOR WHICH A PRO- 
JECTION I » WANTED; E.G.,^OTH YEA«; AND" 

3. THE, LENGTH. OF X^'^e'^TIME INTERVAL BETWEEN SUCCESSIVE PRO- 
JEClriON PQLNTS (I.E., INCREMENT); E.G.,1 YEAR^ 

— ^' ■ . • , ■ ■ . ■■ ■ ■ ■ 

^tO ALLOW FOfif THE U$E OF FRACTIONAL TIME PER lOOS* ( SUCH AS THE USE 
^QF AN AVE Wffi I NG( FACTOR), EACH PIECE OF DATA ENTERED .HERE MUST 
INCLUDE •fr' DEC I MM. POINT. 

FOR\E-XAMPLE , THE VALUES LISTED ABOVE WOULD BE ENTfREQ AS/ 
S.,'\0.,\. 

THE COMPUTER. THEN PERFORMS .THE CALCULATIONS ^NECESSARY TO PRO- 
^JECT'THE LOG-PROBABILITY CURVE PREVIOUSLY CALCULATED* .. .^ 

IN ADDITION, IF THE NUMBER OF RETENTION DATA SETS ENTER£45 IS 
GREATER .THAN 2, .THE' PROGRAM CALCULATES THt 95% CONFIDENCE 
RANGE VALUES FOR EACH PROJECTED POINT. THESE ARE CALCULATED FOR 
BOTH NUMBER AND PERCENT VALUE^. THUS, -FOR EACH PROJECTION POINT, 
THE PROGRAM CALCULATES AN ;iEXPECTED VALUE" (OR "PROJECTED ESTI- 
MATE") AND A 95% RANGE OF VALUES (LOWEST AND HIGHEST). 

TH^ PRINTED OUTPUT FOR THIS PORTION-'OF THE PROGRAM IS (USING T-HE 
HYPOTHETICAL DATA SET AND THE PROJECTION LIMITS 5.5,10.5,1.): 
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LOGPRd IKSTR? :PC6) OF (7>. 



eOMpUTER WRITTEN. 



.:|- 



..^.-:.v^l^V------TA^^^^ OF PROJECTED VALUES- 



■ t ■ ■ ' ■ -4. 




: YtAB 


----NUMBER PROJECTED-- — 






EXPECTED 


■ 








VALUE 


- — 95% RANGE- — 




* 
* 


5.50 


V 26. 


2 5.: - 26. • 


0 ■ 




6.50 




• '22. - -^3. 






7.50 


' ■ 20. 


1 9. - t%. 




* _ 


a.si) 




.17. - 18. 




* 


9.50 




16C - 16. 




• * ■ 


10.50 




14. - 15. 




'* 


















• '« 











___ J-PERCENT .PR0JECTfidf-.-n7> 
EXPEXJTED 
•VALUE ■ 

' 23.3'* 
20,34 
17.96 
1.6.02 
r'«.4 3 . 
13.08 



■---95% 

' '2Z.'96 ii 'b>72' 
I9i99 - 120.69 
,17.63 - ;18.28 
15 i'73 - ■16.33 
14.15 - 14.71 
.12;:82 - 13.35 



2, THEN THE 95% RANGE CANNOT BE CALCULATED AND ONLY I^XPECTED 
VALUES ARE CALCULATED AND PRINTED OUT- ' ' 

- ■■ ' a'- ■ 

IN THIS CASE, THE OUT^|IT WOULD B& (USING ONLY THE. FIRST TWO 
ELEMENTS DF TH^ HYPOTHETICAL DATA SET ANOr THE SAMPLE PROJEC; ^ 
TION LIMITS ABOVE)^^ 



.9& . 



■---NUMBER PROJECTED—- --PERCENT PROJECTED 



* ---TA3LE OF PROJECTED VALUES- 

* . _ 

* YEAR 
*. 

; J 

* 
* 
* 
* 
* 
* 



5.50 
6.50 
7.50 
8.50 
9.50 
10.50 



25. 
22. 
19. 
17. 
15. 
14- 



22.66 
19.62 
1 7^22 
15.28 
13.69 
12.35 



RECYCLING J 

NEXT THE COMPUTER A^^KS^ ' - . ; 

* RUN AGAIN WITH A DIFTFRENT DATA SET? (Y OR N) . 

IF THE OPERATOR WISHED TO RUN ANOTHER SET OF DATA THROUGH THE 
"LOGPRO" PROGRAMMING SEQUFNCF THEtJ THE ANSWER T-0 THIS QUES- 
TION IS "Y- (I.E., YE^). ^ - 

IF. ALL DATA SETS HAVE ALREADY BEEW ANALYZED THEN JHE ANSWER IS 
"N^ (I.Ei, NO). - 208 - ^ 
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LOGPRO INSTR« P(7) ,0F (7) 



SUBPHabRAMSS - '^^^^^^ ' ' ^ 

THE •'•^bfiPRid*' PROGRAMMING SEQUEN9E USES TWO SUBPROGRAMS WHICH 
CONSTITUTE the; BASIC STATISTICAL.IECHNIQUE NEEDED FOR LGG- 
PRqeABTLITy ANALYSIS, f 

THE FIR^T, KNOWN AS "AND'PjE", TRANSFORMS "PERCENT RETAILED"" 
I NJd /number of standard DEVIATIONS FROM THE MEAN OF THE' 
NORnALCURVE". THE SECOND, "ANDXP", IS tHE INVERSE OF 
■•^^^ANDP-X". (SEE "LOGPRO* TECHNICAL ANALYSIS FOR A DETAILED EX 
;l?LANATiON OF THESE TRANSFORMATIONS.) ; _^ 

■ :■ ■ ■ ■ 




) . 



/ 
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LOGPRO 



PAGE 1 



C 
C 
C 

C 
C 
G 
C 
C 
C 
C 
C 



-LOG-PROBABILITY ANALYSIS OF GROUP RETENTION TREND- 




32 
3V 
40 

42- 

44 

. 47 

51 
70 

s 

100 



f 



dALCOLATES THE LOG-PROBABILITY EQUATION FROM LONGITUDINAL 
RETENTION OA.T A WHICH IS EITHER IlA THE FORM (A) "NUMBER OF 
EM'PL6Y££S RETAINED" OR (B) "PERCENTAGE (OF ORIGINAL GROUP) 
OF EMPLOYEES. RETAINED". ALSO CALCULATES THE STANDARD DEVIA- 
TION OF Fit: _ . • 
IN. ADDITION, IF DESIRfb, PROJECTS AND/OR mTERPOLATES fHE 
'log-probability CUR^VE and CALCULATES 95 X CONFIDENCE RANGE 
VALUES' FOR EACH PROJECTION POINT. ^" 
REQUIRES FUNCTIONS. ANDPX AND ANDXP. ,4 

' 'iCAL INV - • " '"-^ ^ ' ' 

D?MEN?I On'^ X (It ) , Y ( ro ) , XL ( 1 0 )», YS ( I 0) ,A 1 0 ) , A Y ( 1 0 ) , AA ( 1 0 )^ NX ( 36 ) 
1' ,NDH(36) ' 

+,IV(10,50),IYR(10) ^ , , . • 

DATA (NX(I),I=1 ,36)/36*-XX'/,(NDH(I),I=1,36)/36* — / 

INV=.FALSE. ' ■ • 

WRITE (5,27) - ^ " , ' 

F0RMAT(//1X, 'PLEASE ENTER THE NUMBER OF YEARS FOR 
1 'WHICH RETENTION DATA ARE AVAILABLE',/) 
READ (5,40)N 

IF(N.GT.2)INV=.TRUE. ' <■ ' 

WR ITE(532) 

FORMAT (//IX, 'PLEASE ENTER THE NUMBER IN THE STARTING GROUP',/) 
READ (5,34)NO . • « 

FORMAT (f6) 

AN = NO . , " , 

FORMAT (12) 
WR I TE ( 5 42 ) 

FORMAT (//,1X,'WILL INPUT DATA BE IN' , / , 3X ,' ( 1 ) NUMBER OR',/, 
1 3X,'(2) PERCENT', /.3X,'(ANS 1 OR 2)',/) 
READ (5,44)NP - 
FORMAT (12) 
WR ITE(547) 

FORMAT (//IX, 'PLEASE ENTER- THE VALUES OF X (TIME) AND Y (NUMBER' 

1 ' OR PERCENT')' ^ 

WRITE(551) 

FORMAT( IX, 'RETAINED BY TIME X) IN THE FORM X,Y SJtoWING ALL DECT 
1- 'MAL POINTS',/) 
DO 100 1=1, N 
READ (5,*)X(I),Y(I) 
FORMAT (2F1 0.4) 
IF(NP.EQ.l) Y(I)=Y(I)/AN 
XL(I) = AL0G10(X(I)) 
YS(I) = Yd) 
iSiV) = ANDPX(YS(I)) 
CONTINUE 
SUMX = 0.0 
SUMY ='0.0 
SUMXY =0.0 
SUMX2 = 0.0 
SUMY2 =0.0 
DO 2^0 I ,= l,N 
SUMX = SUMX + XL(I) 

( ' - 213 - ' 
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SdMY = SUMY + YS(I) ■ ^ * 

SUMXY = SUMXY + XL(I)*YS(I ^ 
SUMX2 = SUMX2 + XL(I)*XL(I 
SUMY2 = SUMY2 + YS(I)*YS(I) 
220 CONTINUE . ' , , . 

, EN = N ^ ■ 

51 = EN*SUMX2-SUMX*SUMX 

52 = EN*SUMXY- SUMX*SUMY 
• SL = \2/S\ 

YIN = 5tJMY/EN-(SL*SUMX)/EN 

380 '• 00 i^SO I = 1 ,N ^ . 

A(I) = YtN + SL*XL(I) 
SUMD=SUM0+(A(I)-YS(I))**2 

A(I) = ANDXP (A(I)) V 
450 CONTINUE •' \ 

ASO=0.0 ... 

IF(.NOT.INV) GO TO 522 - 

ASD=SQRT(SUMO/(EN-].)) 
522 00 527 I = l.N ' 

AY(I) = AN*Y( I ) 

AA(I) = AN*A(I) 

■ 5i?^^ CONTINUE ^ ' . / 

. yfs . WRITE(5,529)(N3l((I),I = l,36) A , 

;29 F0RMAT(///7/, IX,36A2) 

foRiAi(//!l7X.-TABLE OF LOG-PRDBAB I L I T Y ANALYSIS RESULtS-) 
CX WRrTE( 5,823)(NDH(I),I = l,2]) ^ 

823 FORMAT( 16X,21 A2) ' y . 

534 FoJJAT(/!m.---ACTUAL DATA--'^3X.---L - P CURVE---) , 

53^ FoiKAin7x!-YHAR-.3X.-NUMBER PERCENT*- .3X .'NUMBER PERCE|s,X- 

; 1 ,//) • 

00 543 1=1, N > . — 

y( I) = Y(I)*10(5. 

FORMAT{l7X,F%.2,3X,F6.0,2X,F7.2,3X,F6.2,2X,F/.2; 

Sk3 CONTINUE . x 

WRITE(5,932)(NOH(I),I=l,2]) 

932 F0RMAT(/,16X,2]A2) • " 

F0RMa1(?!i5J';-THE number in the STARTING GROUP WAS:'. 16) 

?0rfST(/!l5X.'THE LOG-PROBABILITY EQUATION IS') 

WRITE(5,894)YIN,SL \-rQc-W-^ " 

F0RflAT(l7X,'Y = -,F9.5,'' + ( ,F9.5, ) X ) 

FoiJAi(/!?5J?'?HE STANDARD DEVIATION OF FIT IS-.F9.6) ' ' 
WRITE(5,9«7)(NX(I),J=>,36) . 
907 vJ-ORMAt(//lX,36A2> , . , 

^5^0 FOrJat'(//)iJ.-IS a projection desired? (YtSpY. Nb=N)-./) 

READ ( 5,560)L ♦ ; _ # 

V 



542 



934 
892 
894 
896 



• LOGPRO N ^^^^ ^ 

560 FORMAT (A21 

IF (L.EQ.'N') GO TO 690 a . , 

WRITE (5,590) ^ 

590 ~ FORMAT t/lX, 'PLEASE ENTER THE MINIMUM AND>1AXIMUM VALUES OF 
1 ' X DESI-RED') ^ ^ 

WRITE(5,n82) \ ^ 

1182 F0RMAT(1X,'PLUS THE DESIRED X-INCREMENT (E.Gl, .5 YEAR, 1 
1 'YEAR, ETC.)') . ^ - • ^ 

WRITE (5 1185) ^ 
1185 1 FORMAT(i'X,'IN THE FORM: MIN. MAX, INCREMENT (E.G., 5. , 1 5.., 1 . ) .') 

, WRITE(5,1188) ' - • 

1188 FORMAT( 1«, 'PLEASE SHOW ALL DECIMAL POINTS; ,/) 

READ (5,*) XMIN',XMAX,XlNC ' . . 

-6r0 ^FORMAT (3F8.'») . ^ 

CwRITE(5,H96)(NDH(I),I = 1r^6) -1 . v 

1196 FQRMAT(///1X,8A2, 'TABLE OF PROJECTED VALUES ,8A2„ - ,/) 
WRITE(5,'l201 ) 

1201 FORMATdX,' YEAR -—NUMBER PROJECTED PERCENT 

1 'PROJECTED ') \ 

IF(INV)WRITE(5,1206) \ 

1«06 F0RMAT(7X, 'EXPECTED'. 17X. 'EXPECTED') ' - \ 

IF(INV)WRITt(5,,1208) ^ - . * 

1208 F0RMAT(8X, 'VALUE', 3X. '---95% RANGE---' , 2X , VALUE ^'♦X, 
\ ^ -- — 95% RANGE './) 

QR = XMIN , 
' ASD=ASD*2. . 
630 » ANS = YIN + SL*AL0G1 0 ( QR ) 

ANSI = ANDXP(ANS) ^ ^ 

R1=ANS+ASD 
■'R2SANS-ASD 
ANN=ANS1*AN 
. R1=ANDXB(R1) 

R2=ANDXP(R2)v . 

AN1=R1*AN .' . 

AN2=R2*AN 

R1=R 1*100. ' \^ 

R2=R2*100. ^- ^ 

ANS1=ANS1*100. 

IF(.NOT.INV)WRITE { 5 , 66D ) QR , ANN. ANS 1 
660 FORMAT ( 1 X ,F5 . 2 , 9XvF6. 0 , 20X.F7. 2 ) 

Ir(INV)WRITE(5,665)QR,ANN,AN2,ANl ,ANS1 ,R2,R1 
665 FORMATdX, F5. 2, 2X,F6.0,2X,F6.0,' .F5 . 0, '♦X, F6 . 2 , 3X ,F7 .2 . 

1 ' -'^6.2) 

QR, = QR + XINC 

IF. (QR.LE.XMAX) GO TO 630 

WRITE(5,1282)(NDH(I),I=1 ,28) 
1282 F0^AT(/1X,28A2,'-') 

WRITE(5,907)(NX(I),I=1 ,36) 
690 WRITE (5,700) ^ ' 

700 F0RMAT(///1X,'RUN AGAIN WITH A DIFFERENT DATA SET? <Y OR N) ,/) 

READ (5,720)K 

720 FORMAT (A2) - . 

\IF (K.EQ.'Y') GO TO 25 \ 
STOP * .■ , 

END , , ' ' 

- 215 - 
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ANOPX ' PAGE k 

FUNCTION ANDPX(P) 
C ACCUMULATIVE NORMAL DISTRIBUTION FUNCTION. 

C APt>ROXIMATES X (+ OR - .00000'*) FROM 

DATA A0,A1,A2,B1,B2»B3/2. 515517, .802853, .010328, 1.432788, V 

1 .189269, .001308/ , 
IF(P.GE.l.) Xl = '». ' V 

/ IF(P.GE.l.) GO TO 380 • 

( IF(P.GT. 0.0005) GO TO 70 . « .. 

X 1 = — . ■ ■ ' ' 
• ^ GO TO* 380 ' ' , 

70 IF (P.LT.0.5) go TD 110 ^ ' 

•E = SQRT(AL0G (!./(». -P)**2)) • 

XI = E-((A2*E+Al)*E+A0)/( ((B3*E+B2)*E+81)*E^<1 . ) 
GO TOfl 40 r . . ■ 

no P = 1/-P • 

E = ^RT( ALOG( 1 ./( 1 .-P)**2 )J ^ • , 

Xl = rl.*(E-{(A2*E+Al )*E.+A0>/(((B3*E+B2)*E + B1 )*E+1 .)) 
P = 1 .-P 
140 AXl = XI +0.0005 

AX2 = Xl -0.0005 
API = 'ANDXP(AX1 ) 
AP2 = ANDXP(AX2) 
• AI = (P^-AP2)/(AP1-AP2) 
^ - 8X1 = AXl+AI*(AXl-AX2) ■ 
y Dl - P-ANDXP(BX1 ) ' / 

IF (D» .LT. 0.000001 ) GD TO 290 / 
AX3 = BX1+1.1*D1 
AX4 = BXl-1 .1*D1 

AP3 = ANDXP(AX3) / 
AP4 = ANDXP(AX4) ' 
AJI - (P-AP4)/(AP3-AP4) 
BXl = AX4+AII*(AX3-AX4) 
Dl = P-ANDXP(BXl ) 
290 IF (D» .LE. 0.0000005) GOTO 370 

IF (Dl .GT. 0.0000005) GO TO 340 
BXl = BX1+,0. 0000002 
01 = ANDXP(BX1 ) 
GO TO 290 . 

340 BXl = BXl-0.000aft02 - • 

, Dl *=^ANDXP(BXiy^ 

GO TO 290 • 
370 ANDPX = BXl j 

GO TO 390 ' 31 ^ Si 

380 / ANDPX=X1 
390 ( RETURN 

END 
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ANOXP ■ , PAGE,;5 

I, . 

FUNtTION ANDXP(X)' 

ACCUMULATIVE NORMAL DI-STR I BUT I ON FUNCTION. V\PPROX IMATES P 
FROM X (TO + .Ofi - .0000(301 ). (NBS-55, P. 932) 
XI = ABS(X-)' , 
. . T = 1 ./(]'.'+. 2316i»19*Xl) 

ANDX>- = 1 .■-.3989^:23*EXP(-.(Xl**2 )/2. )*( .319381 5*T-. 3565638*T**2 
1 +1 .78U78*T**3-1 .8J>J256*I**'» + ] . 33027»»*T**5) 
IF (X.LT.O.O^ANDXP = l.-ANDXP 
RETURN < 
- END • ■* 
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LPFILE TECHNICAL ANALYSIS 
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Introduction 



^^^^ 1 
1/ <^ 



■ ■ - ■ ■ : . '^r 

The computer •terminal program L?FILE was developed to enable users to 
"^erfijrm a quantitative analysis of turnover /retention rates for" > ^^>r* 
ungrouped/ noi^-lodfeitudinal data. ' j 

LPFII^ and its- thiee • subprograms (ANDXP, ANDPX and DATE) together f orm- -* 
a comprehensive, 9jelf -contained unit which completes all the necessary 
fetati^tic$'requii?ed for Jthe analysis/of turnover /retention rates. This ^ 
means tliat a user o£ this set of programs need not have an extenfeive 
statistical. background in prdar to successfully utilize its results. 

LPFILE calculates a log-probability equation from previously stored 
data^files. The statistical /cransformatioijs and basic assumptions of^, 
tlie longitudinal log-probability analys±s -method also apply to LPFILE^ 
(See LOGPRO Technical Analysis, pages 3-8). 

Data Collection and Input 

LPFILE is designed to perform log-probability analysis when longitu- 
dinal data are not available. It does so by using computer files which 
contain certdin basic data items oh each employee in. a given group. 
'By utilizing data on every employee, -LPFILE thus utilizes every available 
contributor to the group retention trend. 

The concepts ot "original group'V a^nd "pej^cent retained'' a,s ysed by 
LCX3PR0 (see LOGPRO Technical Analysis, page 1) are carried over to 
LPFILE in a modified fashion. The "original group" is a subset of those 
hired in the past few (e/g., 5) years. The "percent retained" figures 
are obtained fr:bm a cumulative distribution which compares the number ^ 
of employees in the group who could have served for at least X years 
with those who actually hWe served at least X years* Such a distri>^ 
butioii could conceivably b\ carried down to number of- days of service 
(e.g., every member of the group would have at least one day of service 
while only those hired five years ago have five years of iservic^) . 

Calculation of this cumulative distribution is possible with the :* 
knowledge of two dates: ^ 

(1) The date of hire for each member of the wiployee 
group under study; and 

(2) The date of separation for every member who left the 
^ group* 

■> - 



-^Users of LPFILE should also refer to documentation for LOGPRO 
^ / . , - 223 - 
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To perform its calculati^s, LPFILE requires-a data file consisting of 
f our ^data elements: ' i ^ 

• . . \ . 

(1) Occupation or series codej^ \ 

" ' ■ . ' . "■ . 

• (2) Grade at hire; ' ' . - ' ' " * ' • 

\' * ' . • ■ ^ 

(3) \Date of hire (month and y^ar); aiid 

* » ■ *" — , 

(4) . Date'of separation (month and. year) , if any. 

When :these, dat^ items are beitig collect^^d, it^will^be ^ecessa'ty to 
record another piece of information: ^ '^h- employee identif icaticm, 
.numbe]f. This item is needed to facilitate the britiging togethdfr- 
df the other items of data, ^ ' 

The conversion of such a d^ta file into a computer file ia discussed 



in the" LPFILE Operation Manual (pages 1-2)- c . . * 

LPFILE also requires the input of a ''file ending date" This is the 
closing date of the data file and of . the time period under study. 

Statistical Analysis 

From the inputted data file and the file ending date, LPFILE sets up 
two one-dimensional arrays each of which cdntains one value for each 
employee in the f±ld. The first array (Tl) consists of values of* the 
actual timd on board for each employee. The second (T2) consists of 
the\^elap.sed time between the employee date of hire and the file ending 
date. For those employees who hav^^j^ separated,' these two values 
would be equal. 

These values are then used to determine the X, Y pairs needed for the 
log -probability regression analysis. - ♦ 

Quantitatively, thte process is: 

Le t : ; 

Ni = Employee i 

DOHi = Date of hf^ for employee i \ 
DOSi = Date of separation for employee i — 
. DOF File ending date * 



11 

*~ If an employee has not separated, both the month and year of the 
date of separation have the value zero. . 

V . 224 - ' 
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■ ' * . 

Then: ^ ' i ^ r . . 

\'- ' p» 

.T2 (Ni) = DO? - DOHi > 
^If employe^ Ni has separated from the original group, theri 

j:i (Ni) = DOSi --DOHi/' . ^ ^ ' 

If not, then: ' , 

/ ^ ■■ ^ ^ ' ^ 

Tl (Ni) = T2 ^Ni) ^ , ^ 

'■ ' ' , 

(The3e ''leSg^of time" calculations ' are performed by the subroutine^ 
DATE which .does a straightforward subtraction of two inputted dates,) 

Each value in Tl and T2 is^ then . converted into its base 10 ibgaritbn. 

The next steps form a cycle which calculated the X, Yjpair^s. A cycle 
begins with the finding of a maximum 'value^rom the^Z v^tor (TMAX) .^n 
Each cycle produces a new .TMAX. and this value gets sm^lleV each tlm^ 
through the cycle (as old values of TMAX are marked as such). 

' .. ' 

Let-:, . * ' tl ' ^' ~ 

NP = the number of 'employees who could hS|fe pos^bly survived 
to at least time TMAX. ^ p3 

NR = the numjier of employees ^ho did survt-^^k.^to at :|feast tijiie 
TMAX (i.e. ]^ere, retained) , ■.. 

* ■ (i ' 

Both these values serve as counte^^s and both are> initialised af^ zero 
at the start of a cycle. ^ I |N ' 

For every value in the T2 vector wll^c^t is greater ^than or equal to 
' TMAX, one is added to the NlP counter. If , in these cases, the 
associated value in Tl is also greater than or equal to TMAX, one 
is added' to the NR ^counter. 

At the eiid of eifeh cycle, ^a "percent retained" value, PR, is ^ 
calculated using the Summed "values NP and NR: 

PR = M 
NP 

Then an X, Y pair is recorded for tlife^'lo^-probabili^ty r^ifression 
analysis with (using ANDPX - See LOGPRO Technical Analysis.,^ page 7) 

- X = TMAX ■ . ^ , " ' ; 

Y = ANDPX (PR) 1/ ' . ^ r ' 



— ^ ANDPX acts as a. function whose valu^ is given by' the computation 
of the approximation contained in the^ subprogram. 

- 225.1; 
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A new cycle is then, begun. ^ This process continues until all possible 
values of TMAX h^ve been used. When this f>oint is reached, ^11 the 
recorded X, Y jiairs are used to calculate a log-probability equation 
and a standard deviation. ^These results are output to the^jaser. - . 

Data Groupings' 

^ • c ^ 

The Qperator is^iven the opjtion to run LPFILE using one or .both of ' 
the first two data items in the file: occupation code and grade 
hire. It can also be run for the ejit ire input file. 



This option allows the user to construe^, data files w^ch contain 
records on employees in different occupations. (All employees in, 
one occupation should be placed \in the same data .file.) However-, in 
these cases, separate runs should be made for each occupation. 

The ability i:o calculate log-probability equations by grade at hire 
i»*useful ^for determining whether differences in entrance levels lead 
'to differences in retention patterns. ^ 

"The "entire file" option can be used when t;here is one occupation 
.^er> data f ile./ . " * , 
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tPFIL^ IN,STR.-''P( 1 ) (5)- 



■LPFRE" IS A FORTRAN IV PROGRAM WHICH CALCUUTE^A LQj&-P'ftOQA- 
BILITY EQUATION FOR UNGROUPE'D, NON-LONGITUOINAL: R^ETENTiON DATA. 
IT REQUJRES THREE SUBPRQX5RAMS : " ANDXP" . . "ANDPX" . AND. "DAr"" 

DATA, requirements: - V T ■ ' U' > ' ' ! 



DATA-#E "ENTERED INTO THE "LPFlLEr pRollfeAM FROM PREV-IOUSLY- 
STbftEO DATA F.ILES. THESE FILES CON-TAIN ONE RtCORB PER EMPLQY- 
EB AND CONSIST OF FOy'R SPECIFIC DATA ELEMEN-TS PER RECORD: 

" ' i ' ■ ■ ' • " . ■ ■ ' 1 . '■ ■ 

. '( 1 ). OCOJPATION CdDE; , - 

(2) GRADE AT flIRE; . • " 

(3) DATE OF HlftE (M^NTH AND YEAR); AND 

(li) DATE OF SEPARATION^ (MONTH AND YE^R), IF ANY. 

...M v-JUCECTION Pl^ASE.F.OR THE DATA FILES.-REQUIRES AN ADDtTIONAt- 
ITPM OF data: AN EMPLOYEE IDENTIFICATION NUMBER (E.G.. SOCIAL 
SECURITY ACCOUNT NUMBER). THIS ITEM IS NECESSARY S4NCE THE 
OTHER, ITEMS ARE MOST LIKELY FOUND ON SEPARATE PIECES OF PAPER. 

A SINGLE DATA FRE fAY CONSIST OF (A) ALL EMPLOYEES £)R (B) 
ALL EMPLOYEES IN A GIVEN OCCUPATION OR (C) ALL EMPLOYEES IN 
A FEW GIVEN OCCUPATIDNS. 

A SAMPLE COLLECTION FQ^ MIGHT BE: 

EMPLOYEE OCCUPATION ^ GRADE DATE OF DATE OF 

NUMBER CODE AT HIRE HIRE SEPARATION 

.\„_--- ----- 

- 1 201 5 10/70 '»/7i» 

2 ■ 212 7 3/72 NONE 

212^ 5 v1/71 2/72 



\ 201 7 5/7l» NONE 

5. 201 7 3/73 10/73 

ETC. ' . • ^ 

A FORMxSUCH A-^THIS MAY BE CONVERTED TO A COMPUTER FILE IN TWO 
WAYS. THE CHOICE OF METHOD WILL DEPEND ON WHAT THE USER S 
'time-sharing SYSTEM WILL ACCEPT. IN i&NY CASE. IF AN EMPLOYEE 

has not separai.ted, both the month and year of separation are 
given the value zero. . ^ 

In addition, the data items are always placed in the file in , 
the following order: occupation code, grade at hire, month - 

OF HIRE, YEAR OF HIRE, MONTH OF SEPARATION. YEAR OF SEPARA- 
TION: . ' 

FORMATTED DATA FILE -- IF' THE DATA f^ILE IS* IN THIS FORM, THEN 
ALL DATA ITEMS FOLLOW EACH OTHER WITH NO SEPARATION CHARACTER 
BETWEEN THEM.. HOWEVER, ALL SPACES CALLED FOR IN THE FORTRAN 
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LPFILE INSTR: P(2) OF (5.) 

f ■ * 

"FORMAT" STATEMENT MUST 3E ACCOUNTED KOR IN EACVl LINE OF THE \ 
FILE (ONE LINE OF FILE = ONE EMPLOYEE). 

AT PRE'SENT, THE INPUT FORMAT STATEMENT FOR DATA FILES IN "LPFILE" 
is: 

., . '■ F0RM'AT(I5,5I2) 

THIS FORMAT RESERVES FIVE SPACES FOR OCCUPATION CODE AND TWO , 
SPACES EACH FOR GRADE AT HIRE, MONTH ANO YEAR OF HIRE, AND 
MONTH AND YEAR OF SEPARATION, ALL VALUES ARE IN INTEGER FORM 
WHICH MEANS THERE ARE NO DECIMAL POINTS. 

IN THIS CASE, THE SAMPLE FORM ABOVE WOULD BE CONVERTED TO A 
COMPUTER FILE IN THE FOLLOWING WAY:- ^ 

00201 05 10700'»7't # 

002120703720000 

002120501 710272 

OO'20107057'tOOOO « 
002010703731073 

ETC. ' . ■ ' 

NOTE - IN MOST SYSTEMS, LEADING ZEROES MAY BE REPLACED BY 
BLANKS ^WlTHOUT AFFECTING THE OPERATION OF THE PRO- 
^ GRAM. - 

UNFORMATTED DATA FILE IF THE DATA F I LE ! I S I N ' THI S FORM, THEN 
THE DATA ITEMS, IN THE FILEARE SEPARATED By COMMAS. IN THIS 
CASE, THE SAMPtE COLLECTION FORM WDULO BE tONsVERTED TO A COMPUT- 
ER FILE IN THF FOILOWING WAY: 

201 ,5, 10, 70, I*, 7'* IN 
212.7,3,72,0.0 
212.5,1.71,2,7? 
201 ,7. 5. 7'*, 0,0 

201.7,^.7-'.io.7^ 

OEPENniNG UPON THE COMPUTER SYSTEM USED. SUCH A FILE MAY fiE READ 
IN BY A NUMPERFD FORMAT STAIEMFNT (S'ICH AS THE ONE ABOVE). OR IT 
MAY BE READ IN iKING AN " - IM r» Arr nr A FORMAT STATEMENT NUM- 
V BER. 

EXECUTION cnr'MANOS:. ( ' ♦ ' , 

TO BEGIN EXECUTION- OF THE "LPFILE" PROGRAMMING SEQUE^^CE,. AN 
OPERATOR WILL 'PERFORM A CHAIN OF EXECUTION COMMANDS. THE 
ACTUAL FORM OF THESE COMMANDS WILL DEPEND ON THE TIME-SHARING 
SYSTEM BEING USED. IN GENERAL, THESE -COMMANDS^ W I LL PERFORM 
THE FOLLOWING OPERATIONS: / \ 

- CALL UP ''LPFILE" AND ITS THREE SUBPROGRAMS "ANDVP", . 
"ANDP/" AND "DATE" AND TRANSLATE THEM INTO MACHINE 
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LANGUAGE. THIS IS THE COMPILATION PHASE. 
LOAD THE COMPILED PROGRAM AND SUBPROGRAMS INTO 
THE CENTRAL PROCESSING J^REA AMD START PROGRAM 
; RUN. THIS IS THE EXECUTIONvPHASE'.' 

OATA ENTRY; • . : 

(NOTE THAT ALL OPER^ATOR-ENTERED R&SPONSES TO COMPUTER-WRITTEN COM 
MARDS ARE FOLLOWED .BY A CARRIAGE RETURN.) 

THE "LPFILE" PROGRAMMING SEQUENCE BEGINS WITH THE "FOLLOWING COM- 
( MANd: . . ' 

♦ENTER FILE ENDING DATE (MONTH, YEAR) ' 

' • ' . . . ■ 

AT THIS POINT; THE OPERATOR .ENTERS THE MONTH AND YEAR OF THE 
FILE ENDING DATE. 'THIS IS THE CLOS ING, DATE OF THE DATA FILE 
AND .OF THE TIME PERIOD UNDER STUBY. THE DATE iS ENTERED IN 
THE FORM MONTH, YEAR (E.G., 6;75). ^ 

NEXT, THE COMPUTER ASKS": , - 

♦ENTER THE -NUMBER OF EMPLOYEES IN THE FILE 

THE Of^ERATOR ENTERS THE NUMBER OF EMPLOYEES FpR WHICH TflERE ARE 
RECORDS IN THE\FIl,E- 

next: 

♦ENTER THE NAME OF YOi/r TURNOVER DATA FILE . . 

♦(MUST BE .LESS THAN OR. EQUAL TO 5 CHARACTERS) 

AS EACH DATA FILE IS ENTERED INTO COMPUTER STORAGE, IT IS GIVEN- 
A NAME TO IDENTIFY IT. IT IS THIS NAME THAT IS ENTERED HERE. 

AT^PRESENT, THE FORMAT FOR THE INPUT OF THE FILE NAME IS "AS". 
IT WILL ACCEPT- A NAME . UP TO FIVE CHARACTERS IN LENGTH. HOW- 
EVER, SOME COMPUTER SYSTEMS MIGHT REQUIRE A DIFFERENT FORMAT- 

IN ANY CASE, THE FILE NAME MUST BEGIN WITH AN ALPHABETIC CHAR- 
' ACTEft. '• • 

NEXT, THE COMPUTER ASKS: ^ 

♦DO YOU WISH BREAKDOWN BY: 

♦ (1) OCCUPATION 

♦ (2) GRADE 

♦ (3) BOTHvOR 

♦ C*) NONE. RU^J WHOLE FILE 

♦ (ANS 1 , 2, 3 OR i*) 

"LPFILE" CAN BE USED TO CALCULATE LOG-PROBABILITY EQUATIONS 
BASED ON OeeUPATI ON, GRADE AT HIRE GR BOTH. IT CAN ALSO BE ' 



- 231 



220 



* = COMPUTER WRI TTEN 



'lPFILE INST,R: P(A) OF (5) 



USED TO CALCULATE ONE LOG-PROBABILITY EQUATION FOR THE ENTIRE 
INPUT FILE. THIS OPTION WILL BE USED MOST OFTEN WHEN FILES AF 
SEPARATEO BY OCCUPATION. HOWEVER, IF" ONE FILE CONTAINS MORE 
THAN ONE OCCUPATION, SEPARAtE RUNS SHOULD BE MADE. ( 

IF OPTION "l". IS CHOSEN, THE COMPUTER -ASKS : 

♦ENTER DESIRED OCCUPATION COOE 

THE OPERATOR ENTERS THE NUMERICAL CODE OF THE OCCUPATION DE- 
SIRED (E.G., 201). THE COMPUTER WILL THEN SEARCH THE^NPUT 
FILE ANO PULL OUT ONLY THOSE RECORDS- OFaEMPLOYEES WITH THAT 
.OCCUPATION. ; JT 

IF OPTION "2" IS CHOSEN, THE COMPjJTER ASKS: 

■y 

*ENTER pESIRED GRAOE 

THE OPERATOR ENTERS THE NUMERICAL VALUE OF THE GRADE DESIRED 
(E.G., 7). UNOER THIS OPTION, ONLY THE RECORDS OF EMPLOYEES 
WITH THAT PARTICULAR GRADE AT HIRE ARE PULLED OUT. 

IF 0PTI0N^"3" IS CHOSEN, THE COMPUTER TYPES. OUT BOTH OF THE 
COMMANDS ABOVE ANO THE USER ENTERS THE APPROPRIATE NUMERICAL 
VALUES. ' IN THIS CASE, THE COMPUTER SEARCH IS FOR RECORDS 
WITH THE DESIRED OCCUPATION AND GRADE COMBINATION. 

UNDER OPTION "'t", NO FURTHER QUESTIONS ARE ASKED AHD THE COM- 
PUTER USES THE WHqLF niF FOR ANALYSTS PURPOSES. ^ 

ANALYSIS OUfPUT.S: 

THE COMPUTER THEN CALCULATES A LOG-PROBABILITY EQUATION BASED 
ON THE INPUT PARAME'ERS, A "^TANDARO DEVIATION IS ALSO CALCU- 
LATED. THF PF<:i)l I A"F IMF" ni'Tpiii m THE USER IN A l^ABELED 
f 0PM. 

UNDEf OPTinN "1", IMF I /^nn r - in ! pr rnnh? 

*FOR ncruPATiON! yy^ 

UNDER OPTION -7'', IT I ■ 
*FOp GRADE: VX 

'option "3" COMBINES THE FIRSI TWO LABELS 

*FCm occupation: XXX K 
*F0R grade: XX 



UNDER OPTION "A", THE LABEL IS: 
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• f. - LPFILE. INSTR: R(5) OF (5).-- 
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THEN THE RESULTS ARE PRfNTED OUT IN THE FOLLOWING FORMAT* 

* ' ■ . ■ 

♦THE L-.P EQUATION ISs 
*Y = 0.,123'»5 ♦ (-1.23^56)X ' 

•*■■ ■■■ 

♦AND STANDARD DEVIATION IS: 0.123'»5 



NEXT. ^ ... J 

♦WRITE/OUT ACTUAL AND CURVE VALUES? (Y OR N) 

AT THIS POINT, THE OPERATOR HAS THE OPTION TO WRITE OUT THE X,Y ^ 
PAIRS WHICH WERE USED TO CALCULATE THE LOG-PROBABILITY EQUATION. 

• (, » 

IF "Y" (OR YES) IS INPUT, THE ACTUAL AND CURVE VALUES ARE PRINT- 
ED OUT UNDER THE E;0LLPWINC HE^NGS: 

♦•T HE.' RETENTION VALUES ARE^ 

* L.O.S. -PERCENT RETAINED- 

♦ (YEARS) -ACTUAL- -CURVE- 

PECYCL-ING: ,• ' 

NEXT, THE COMPUTER ASKS: 

♦AGAIN WITH THE SAME FILE? (Y OR N) 

IF THERE ARE MORE OPERATIONS TO BE RUN FOR THE SAME INPUT FILE 
(SUCH AS'FINDING A LOG-PROBAB I L I T Y EQUATION FOR A DfFERENT OC- 
CUPATION)^' THEN THE ANSWER TO THIS QUESTION IS "Y". IF "Y" IS 
ENTERED, THE COMPUTER RECYCLES TO THE POINT WHERE A TYPE OF BREAK- 
DOWN IS REQUESTED (SEE PAGEM ) . 

IF T^^ERE ARE NO m6rE OPERATIONS TO BE RUN ON THIS SAME FILE, THE 
ANSWER IS "N". IF "N" IS FNIERFO, THF COMPUTER TYPES: 

♦AGAIN WITH ANOTHER rilF? (Y OR N) 

f 

IF THERE IS ANOTHER rjLF FOR WHICH ANALYSIS IS DESIRED, THE ANS- 
WER TO THIS QUESTION IS "V". THF COMPUTER THEN RECYCLES TO THE 
BEGINNING OF THE PROGRAM, 

c 

IF, THERE ARE NO MORE FILES TO BE ANAL Y2E0 THEN THE ANSWER IS 
"N". THE COMPUTER THEN TFRMINATFS THE RUN OF "LPFILE". 
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LPFILE. 



THIS PROGRAM PERFORMS LOG-PROBABILITY ANALYSIS'ON 
iJNGROUPEO, NON -LONG I-TUp I NAL RETENTION DATA. 

DATA ARE ENTERED INTO THE PROGRAM FROM pf^tVIOUSLY- 
STOREO EMPLOYEE. FILES CONSISTING OF FCUJR DATA ELE- 
MENTS PER EMPLOI^E (OCCUPATION CODE, GRADE. AT HIRE, 
DATE OF HIRE, DATf OF SEPARATION, IF ANY). ONE 
FILE RECORD EQUALS~ONE EMPLOYEE. 4 

'LOG-PROBABILITY EQUATIONS MAY BE CALCULATED BY OCCU 
PATION AND/OR GRADE AT HIRE. 

REQUIRED SUBPROGRAMS: ANDPX, 'ANDXP, DATE 
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LPriLE PAGE 1 

DIMENSION I T(6),T1( 500), T2(500),XX(500),YY(500),YA(500) 
20 ; WRITE(5,30) 
: ; 30 FORMAK//////, IX, 'ENTER FILE ENDING DATE (MONTH, YEAR) ) - ' 

READ(5,*)MGF,MYF 
50 FGRMAT(2I2) 

60> . WRITECS.yO) ' ■ 

70 FORMAT(/, IX, 'ENTER THE NUMBER OF EMPLOYEES IN THE FILE ) 

READ(5,90)N ' ' . 

90 FORMAKI**) 

WRiTE(5',no) - • ' 

no FORMAT(/, IX, 'ENTER THE NAME OF YOUR TURNOVER DATA FILE' 

1 ,/,lX,"'(MUST BE LESS THAN OR EQUAL TO 5 CHARACTERS.);^) 
' READ(5,1'»0)NAME 
140 F0RMAt(A5) 
150 WRITE(5, 160) 

160 F0RMAT(/,1X,'D0 YOU WISH BREAKDOWN BY:' ,/ , 8X ( 1 ) OCC 

1 'UPATI0N',/,8xr('2) GRADE', /,8X,'(3) BOTH OR',/, 

2 8X,'(4) NONE. RUN WHOLE FILE',/,8X,'(ANS 1, 2, 3' 

3 'OR 4)') 
READ(5,210)IBR 

.210 F0RMAT(I2) 

IF (IBR.EQ.4)G0 TO 330 . 
IF(.IBR.EQ.2)G0 TO 290 \ 
WRI TECS, 250) 

' 250 FpRMAT(/, 1>C,'ENTER DESIRED OCCUPATION CODE ) 

READ(5,270)NOC 
270 FORMATCIS) 

IF (IBR.EQ. 1)G0 TO 330 
290 WRITE(S,300) 

300 FORMAT(/, IX, 'ENTER DESIRED GRADE') 

READ(5,320)NGR 
320 F0RMAT(12) 
' 330 CAL'L IFILE(20,NAME), 

KI=0 

DO 510 1 = 1, N , ' I, 

350 READ(20,355)(IT(J) , J=l .6) 

■I 355 FORMAT(I5,5I2) 

IF(IBR.EQ.4) GO TO 410 

IF( IBR.EQ. 2)G0 TO 400 

IF(IT(1).NE.N0C)G0 TO 510 

IFdBR.EQ. 1 ) GO TO 1 0 
400 IF(IT(2).NE.NGR) GO TO 510 

(410 M0H=IT(3) 
MYH=IT(4) 
M0S=IT(5)' 

.MYS=IT(6) • • 

IF(MOS.EQ.O)GO TO 480 
CALL DATE(MOS,MYS,MOH,MYH.DA) 
IF(DA.EQ.O)GO TO 510 . 
480 CALL DATE(MOr,MYF,MOH,MYH,DT) j 

IF(DT.EQ.O)GO TO 510 
KI-KI+J 

o T2(KI)=DT . . ' 

IF(M0S.EQ.0)T1 (KI )=DT 
IF(M0S.NE.0)T1 (KI)=DA 
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^10 CONTINUE 

.TM>VX=AL0G10(T2(1 )) 

TMIN=TMAX . - 

00 580 1 = 1 ,.KI ^ 

Tl(r)=AL0G10(Tl(U) . 
. T2(I)±AL0G10(T2(I')) 
IF(T2(n.LT.TMlN)TMIN=T2(I ) 
580 IF(T2(I ).GT.TMAX)TMAX=T2(I ) 

SUMN=0.0 S. fc; 

SUMX=0.0 • ^' 

SUMY=0.0 ' - 

SUMXY=0.0 . 
SUMX2=0.0 

■ : J=o '■ . ■ ■ .. ■ ••■ ■ . r':: 

M=0 

.660 ENPOS=0.0 . 

ENRET=0.0 
DO 740 K=1 ,KI 

IF(T2(K).LT.TMAX)G0 TO 740 
ENPdS=ENPOS+l . 
. IF(T1 (K).GE.TMA'XJENRET=ENRET+1 . 

IF(T2(K).NE.9. )J^1 ^ 
T2(K)=9. \ 
740 CONTIVjUE >■ 

REt=ENRET/ENPOS 

IF((RET.EQ.1.).OR.(RET.EQ.O.O))GO TO 890 ' - 

Y=ANDPX(RET) 
X=TMAX 

M=M+1 • - . 

XX(M)=10.**X • 
YY(M)=RET 

' . ^ NO=ENPOS . • 

C DO 880 MM=1 ,N0 

SUMN=SUMN+1 . • ' 

SUMX=SUMX-^X 

SUMY=SUMY+Y 

SUMXY=SUMXY+X*Y 
880 SUMX2=SUMX2+X**2 
890 IF(J.EQ.KI )G0 TO 940 

TMAX=TMIN , ■■ 

DO 920 L=l ,KI . ^ 

920 IF((T2(L).NE.9.).AND.(T2(L).GT.TMAX))TMAX=T2(L> 

MF(J.LT.KI) go to 660 ' / 

940 EN=SUMN 

Sl=EN*SUMX2-iSUMX*SUMX 
S2=EN*SUMXY-S,UMX*SUMY 
B— S2/S 1 
- A=SUMY/EN-(B*SUMX)/EN 
SUMD2=0.0 

DO 1020 LL=1,M A 
YA(LL)=ANDXP(A+B*ALOG10(XX(LL) )) 
1020 SUMD2=SUMD2+( YY(LL)-YA(LL))**2 ' 
EM=M 

■^SD=SQRT(SUMD2/(EM-V. ) ) '\ , 

WRITE (5, 1060) \ 
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LPFILE 

■1060 FORMAK ////////) 

IF( IBR.EQ. 1.0R.IBR.EQ.3)WRIT£(5,1080)NOC 

1080 FORMAK IX, 'FOR OCCUPATION: ',15) 

IF(( IBR.EQ-.2).0R. (IBR.EQ. 3) )WRITE(5,n00)NGR 

ndO FORMAK IX, HFOR GRAOE: ',12) ♦ 
IF( rBR.EQ.l»')WRITE(5,n20) 

1120 FORMAK IX, ^FOR THE ENTIRE^ FILES') ^ 
WRITE(5,121»0)A,B 

1240 FORMAK/, IX, 'THE L-P EQUATION IS: ',/,IX,'Y = '.F9.5,' + ( 
1 ,F9.5,')X') 
WRITE(5,1270)SO 

1270 FORMAT (/, IX, 'AND THE 'STANOARO OEVIATION I.S: ',F9.5) . • 

WRITE(5,1127) ''^ . 

1127 FORMAT(/, lX,'WftITE OUT ACTUAL ANO CIJRVE VALUES? (Y, OR N) ) 
REA0(5, 127l»)KWR > . 
. . IF(KWR.EQ.'N')GO TO 1271 # 
WRITE(5, nl»0) 

llifO F0RMAT(/,1X'THE RETENTION VALUES are;') ' . 

WRITE('5, 1160) ■ ^ 
1160 FORMATC/, 6X,'L.0.S.',3X, '-PERCENT RETAINEO-') 

WRITE(5, 1180) 

1180 F0RMAT(6X, '(YEARS)', 2:X, '-ACTUAL-', 3X,'-CUF(VE- ,/) 
MN=M ^ % 
00 1 220 MA=1 ,M 

WRITEK<1210)XX(MN) ,YY(MN) ,YA(MN) 
1210 FORMAT(6X,F7.2,.2X^F8.3,3X,F7.3) 
MN=MN-1 

1220 CONTINUE » ' 

1271 WRITE(5,1272) ^ 

1272 FORMAT (///, IX, 'AGAIN WITH SAME FILE? (Y OR N) ) 
'* REA0(5, 1274)NAG . 

12.7'» FORMAT (A2) ^ ^ 

IF(NAG.EQ.'Y*^REWIND 20 

IF(NAG.EQ.'Y')GO TO 150 . ; ^ 

WRITE(5^,1277) ' • 

1277 FORM AT (7, IX ;' AGAIN "WITH ANOTHER FILE? (Y OR N) ) 
REAO(5,1274)KAG 
. IF(KAG.EQ."'Y')GO TO 70 
WRITE(S. 10<^n> 
STOP 
END 
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V ANOPX PAGE 

- FUNCTIdN ANDPX(P) 

C ACCUMULATIVE NORMAL DISTRIBUTION FUNCTION, 

C . APPROXIMATES X (+ OR - ..OOOOQi*) FROM P'. ' oo 

OATA.AO.Al ,A2,B1 ,B2,B3/2.51551 7,.802853 ,.010328f,1 .i»32788, 

1 . 189269, .001308/ . ., ■ 

IFCP.GE.I.) Xlri*. 

IF(P.GE. 1. ) GO TO 380 • 
IF(P.GT.0.0005) GO TO 70 
XI =-4 , . 

V - GO TO*380 

70 IF (P..LT.0.5) GO TO 110 

E- = SQRT(ALOG (1./(1.-P)**2)) * 
XI = E.-((A2*E+A1)*E+A0)/(((B3^E+B2)*E+B1)*E + 1.) 
GO TO UO . 

]W , P = 1.-P :^ 

- E = SQRT(AL0G(1./(1.-P)**2)) H^. 

XI ='^-1.*(E-((A2*E+lM)*E+AO)/(((B3*E+B2)*e?BlJ*E + 1.)) 

P'= 1 .-P . 
140 : AXt = XI +0.0005 ' 

AX2 = Xl -0.0005 ■ ■■ " 

API = AN0XP(AX1 ) 

AP2 = AN0XP(AX2) 

AI = (P-AP2)/(AP1-AP2) . 

BX1 = AX2+AI*(AX1-AX2) 

01 = P-AN0XP(.BX1 ) 

IF (01 .LT. 0.000001 ) GO TO 290 

AX3 = BX1+1.1*01 ■ % 

^ AXi* = BX1-1.1*01 

^ AP3 = AW&XP(AX3) 

/ AP'* = ANOXPCAX'*) 

All = (P-AP'»)/(AP3-AP'») ' 
* BX1 = AX'»+AII*(AX3-AX'») 

01 = P-AN0XP('BX1 ) 
290 IF (01 .LE.0.00O0O05) GO TO 370 

IF (01 .GTiO. 0000005) GO TO 3'»0 / 

BX1 = BX1+0. 0000002 

01 = ANOXP(BXl) 

GO TO 290 ^ 
3kO BX1 = BX1-0. 0000007 

01 = AN0XP(BX1 ) 
-GO TO 290 
3 70. .. ANOPX = BX1 

GO TO 390 
380 AN0PX=X1 

390 RETURN / 

ENO - ^ 
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FUNCTION ANDXP(X) 

ACCUMULATIVE NORMAL DISTRIBUTION FUNCTION. APPROXIMATES^^ 
FkOM X (TO + OR - .0000001). (NBS-55, P. 932) 
XI - ABS(X) 

T = 1./(1.+.2316419*X1) 
ANDXP =1 .-.3989i»23*EXP(-(Xl**2)/2.)*(.31938l5*T-.3565638*T**2 
- 1 +1.781^78*T**3-1 .821256*T**'» + 1 .33027'»^T**5) . 

if (x.lt.o.o) andxp = 1. -andxp 
'return . 

END ' • 



/ 
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SUBROUTINE OATE(rtOI .MYI .MOF.MYF . DAT ) 

NY=MYI-MYr 

NM=MOI-«OF . 

EN=NM 

EN=EN/12'. • 
ENY=:NY' 
DAT=ENY+EN 
RETURN^ 

Elilb ' 



ERIC 



243 \ 



230 



/ 



/i^PENDIX. E 



STAFFING NEEDS PLANNING COMPUTER PROGRAM:" , 

, LPTEST . " ' /, 



ERIC 



245 



APPENDIX E-1 



TECHNICAL ANA 



STAFFING tJEEDS: PLANNING COMPUTEf^* PROGRAM 

LPTEST 
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UNITED STATES C tV I L S ER V I CE COMMISSICN 
■ WASHINGTON. .D.C. 20415 *. 
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• LPTEST TECHNICAL ANALYSISi/^ 



/ * 'Introduction 

. f - ■ * . — '. ' — ; — 

The'" computer tjirtrmlnal prograi^^ LPTEST was devei?oped to' enable users to: 

(1) doinpare sets of retention data to determine whether or/fiot^ 
'they may be grouped together; and 

(2) Use the results of these comparisons as input into the log- 
probability analysis program (LOGPRO) • 

LPTEST and its three subprograms (ANDXP, ANDPX, and FTEST) together form 
a comprehensive, self-contained un,it which coti?ple^« all the necessary 
statistics requirfed to compare retention groups. This means that a user 
of ^this set of programs need not have an extensive statistical background 
sssfully utilize and evalualte its results. . .': . 



in order to succe 



\ 

The form of the input data, the statistical transformations, and theV^.^^.^ 
basic methodological assumptions are the same as tl\ose for LOGPRO • \See 
LOGPRO Technical Analysis, pages 1-8.) -^I^^^ 

^ ^ Data Groupings %i 

'■ ■ ■ ' . ' . I . - : ' , ■• 

The basic groupipg of data for LPTEST iwill be by individual occupation 
aiid. the basic comparison test will be to determine what occupations can 
;J)fe grouped together for log-probability analysis purposes. However,, it . 
is also possible to use groupings which are the sum of two or more occu- 
.^patip^s* In this cas6, it will generally have previously been determined 
that these occupations- are ''compatible/* Summed groupings of occupations 
may .be^ tested against either other summed groupings or individual occupa- 
tioffs. 

- < • 

In addition, groupings can be based on other factors which might be of 
interest to the user. Some of these factors are sex, minority status, 
veterans preference, grade at hire, etc. Since retention rates mayyvary 
. greatly among occupations, comparisons based on these factors shouxd be 
made within the same occupations* 



IJ « Sitice. the reiSults of thi^ program will be used to determine the 
Input to LOGPRO, users of LPTEST should also have th^ documentation for 
LOGPRO. Thus, this Technical Analysis refers to the Techrfical Analysis . 
for LOGPRO rather^ than re-describing common, aspects of the two processes. 
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, Statistical Analysis 

■■■ ^ . ' ir 

LPTEST allows for the initial input of retention data for up to twenty 
occupations or groups,' The operator may select any or all of these 
groups ^or retention trfnd comparisons^ These comparisons are made . 
using an analysis of variance technique, * . - . 

Calculation of Variance - After selecting out the (NG) groups specified 
•by the user (each group liaving NP retention points), LPTEST calciaates 
two variances which are used tp determine a value of the F-statistic, 
These are: \^ ^ ' ; ' 

(1) The variance among the tested groups; and * 

(2) The variance within the' tested groups. 

■ • . , ' ■ • «t . 

The F-statistic is then calculated using the formula': 

F " Variance #mong groups 
Variance within groups 

Where: 1 

Vj^ = degrees of freedom associated with the variance 
among the groups ^ 

\ = NP-1 ^ — ^ ^. 

V2 = degrees of freedom associated with the variance 

within the groups * 

^ ' =? (NP X NG) - 1 ^ .. . . 

Test of Sign^icance - The probability of qhance o^urrence of the cal- 
culated F-statistic is directly determined by the , subroutine FTEST, 
This subprogram uses the following approximatioir-' for X {the number of 
standard deviations from the mean of a normal curi^) : 

.V3 /l - 2 



^ = ^ ■ 9V 

2. 



21 Abramowitz, M, and Stegun, I. A., Eds. / Handbook of Mathematical * 
^ . Functions , AMS55, 9th, National Bureau of Standards, 26.6.15, p, 947!\ 



r 
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The subprogram then uses ANDXP to calculate the accumulative normal 
distribution function value (Q) associated with X, Then, the jjroba- 
bility of chance occurrence (P) is: . 

P « 1 - Q , « . 

P-values of 0.05 or less are considered to be statistically significant. 
The value of P determines whetherv or not the tested, groups may be com- 
bined. 

Lbg-^Probability Analysis - A log-probability equation is calculated for 
each of the tested groups. In addition, if the groups are found to be 
compatible, their retention data. are. summed, and a lo^-probability equa- 
tion is calculated for the total group. The statistical techniques used 
in these calculations are explained in the LOGPRO Technical Analysis 
(pages 6-8) . 

Analysis Outputs 

I 

The form of the final output of LPTEST is labeled, with the codes of the 
occupations or groups tested. The output consists of:. 

(1) A statement of grouping which tells whether or not the tested 
groups may be combined; 

(2) A table of data showing, for each group tested: 

^ 'if 

^ (a) Occupation code, 

(b) Number in each starting group, and 

(c) A- and B-values of each subgroup's log-probability 
equation; and 

(3) When the tested groups are compatible, the A- and B-values of 
the combined group's log-probability equation. 

There are options provided which allow an operator to (a) make further • 
tests using groups whose data have already been entered or (b) enter and 
test a different set of groups. 
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LPTEST INSTR.* P(1) Of (9) 



"LPT-EST" IS A FORTRAN IV PROGRAM WHICH COMPARES THE RETENTION 
TRENDS OF TWO OR MORE OCCUPATIONS TO DETERMINE IF THEY CAN PE 
GROUPED. TOGETHER FOR INPUT INTO "LOGPRO" (THE LOG-PROBABILITY 
ANALYSIS PROGRAM)., "LPTEST" REQUIRES THREE SUBPROGRAMS: "ANOPX", 
■ANOXP",- ANOVFTEST". '^^f 

DATA requirements: t 

THE "LPTEST" PROGRAMMING SEQUENCE IS DESIGNED TO ANALYZE AND 
COMPARE TWO OR. MORE SETS OF LONGI TUOINAL "RETEWTIDN-DVER-TIME" 
DATA. THE RULES OF DATA COLLECTION FOR "LPTEST" ARE THE SAME 
AS, THOSE FOR "LOGPRO". (SEE "LOGPRO" TECHNICAL ANALYSIS OR 
INSTRUCTION MANUAL.) 

GENERALLY. THE DATA SETS INVOLVED WILL REPRESENT DIFFERENT 00- 
CUPATI'DNS. HOWEVER, IT IS ALSO POSSIBLE TO COMPARE GROUPS WHICH 
AftE EITHER SUBGROUPS- OF ONE OCCUPAf>lON OR ALREAOY-GROUPEO OCCU- 
PATIONS. (SEE GROUPING OCCUPATIONS, PAGE 8). 

HYPOTHETICAL DATA SETS: ° ' ^ ^ 

THE FOLLOWING ARE HYPOTHETICAL DATA SETS OF TWE TYPE REQUIRED 
FOR THE "LPTEST" PROGRAMMING SEQUENCE: 

^SUPPOSE THAT DURING FY 1972 AN ORGANIZATION HIRED 110 
CLERK-TYPISTS (GS-322), 1 25^ SECRETARIES (GS-318), AND 270 
PERSONNEL MANAGEMENT SPECIALISTS (GS-201). SUPPOSE THAT , 
THES£ HIRES WERE MADE EVENLY THROUGHOUT THE FISCAL YEAR. 
AND SUPPOSE THAT THE FOLLOWING RETENTION VALUES WERE RE- 
CORDED OVER THE NEXT THREE FISCAL YEARS: ^ 

FOR 322: 

AT THE ENDiOF FY: THE NUMBER RETAINED WAS: 

1973 58 (OR 52.73Z) 

1974 44 (OR 40.00%) 

1975 36 (OR 32.73%) 



FOR 
AT 



318: 
THE END 

1973 

1974 

1975 



OF FY: THE NUMBER RETAINED WAS: 
77 (OR 61.60%) 
62 (OR 49.60%) jfc 
53 (OR 42.40%) ^ 



FOR 
AT 



201 : 
THE END 

1973 

1974 

1975 



OF FY: THE NUMBER RETAINED WAS) 
225 (OR 83.33%) 
201 (OR 74.44%) 
190 (OR 70.37%) 



THE AVERAGING FACTOR FDR THESE DATA SETS IS 0.5 YEARS. 

THESE SETS OF DATA WILL BE REFERRED TO THROUGHOUT THIS MANUAL'. 
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• = cWuTER WRITTEN 
EXECUTiGN COMMANDS: 



LPTEST INSTRS P(2) OF (9) 



TO BEGIN EXECUTION OF THE "LPTEST" f\ROGRAMMING SEQUENCE, AN . 
OPERATOR WILL PERFORM A CHAIN OF EXECUTION COMMANDS. THE j 
ACTUAL FDRM OF THESE COMMANDS WILL DEPEND ON THE TIME-SHARING 
SYSTEM BEING USED. IN GENERAL, THESE COMMANDS WILL PERFORM 
THE, FOLLOWING OPERATIONS: 

.- CALL UP "LPTEST" AND ITS THREE SUBPROGRAMS "ANDXP", 
'"ANDPX" AND ."FTEST" AND TRANSLATE THEM INTD MACHINE 

LANGUAGE. THIS IS THE COMPILATION PHASE. 
- LOAD THE COMPILED PROGRAM AND SUBPROGRAMS INTD 

THE CENTAL PROCESSING AR^A AND START PROGRAM 

RUN. THIS IS THE EXECUTION PHASE. 

DATA entry: 

(NOTE THAT ALL OPERATOR-ENTERED RESPONSES TO COMP\JTE^-WR I TTEN COM- 
MANDS ARE FOLLOWED BY A CARRIAGE RETURN.) 

THE RUN DF THE "LPTEST" PROGRAMMING SEQUENCE BEGINS WITH THE PRINT- 
OUT OF THE FOLLOWING INFORMATION: 

•THfsyPROGRAM-i ANALYZES AND COMPARES THE RETENTION TRENDS OF 2 OR MORE 
•OCCUPATIONS TO DETERMINE WHETHER THEY CAN BE: GROUPED TOGETHER FOR LOG-,. 
•PROBABILITY ANALYSIS. 

• I • 

•FOR PURPOSES OF TH I S~ program: 

• (U THE "X - VALUES" = LENGTH DF SERVICE COMPLETED; AND 

• (2) THE "Y - VALUES" = NUMBER (OR PERCENT) RETAINED AT TIME X,' . . 

• ■ • ■ . 

•SEE INSTRUCTION MANUAL FDR FURTHER EXPLANATION OF DATA REQUIRED. 

NEXT. THE COMPUTER ASKS: \ 

^ •ENTER THE NUMBER OF RETENTION GROUPS TO BE COMPARED 

THE DESIGN OF LPTEST ALLOWS THE USER TO ENTER UP TO TWENTY OCCUPA- 
TIONS AT THE BEGINNING I)F A RUN AND THEN SELECT^UT WHICH SPECIFIC ,;. 
OCCUPATIONS ARE TO BE COMPARED DURING EACH RUN /IF LPTEST S ANALY- 
SIS SEQUENCE. ANY S.UBSET OF THE INPUTTED DCCUf^ATIONS MAY. BE COM- \ 
pArED REGARDLESS OF-THE NUMBER IN THE SUBSET DR THE ORDER IN WHICH 
THE ORIGINAL OCCUPATIONS HAVE BEEN ENTERED. , 

FOR EXAMPLE, USING THE HYPOTHETICAL DATA, ALL THREE DF THE SAMPLE 
OCCUPATIONS CAN BE ENTERED INTD LPTEST ATCONCE. THENTHE RESPONSE 
TO THIS COMMAND WOULD BE "3". OR, IF THE OPERATOR. SO DES IRES , ONLY 
SOME OF THE OCCUPATIONS FOR WHICH DATA AR& AVAILABLE NEED BE. ENTERED. . 
THUS, IF ONLY 322 AND 318 ARE TO BE' ENTERED li^NO- COMPARED , THEN THE; ^: .S, 
RESPONSE TO THE COMMAND IS "2". ^ Z' > • 

EACH OF THE ENTERED'GROUPS SHQtjLD'ftAVE' THE . SAME )iu,HBER 0F» KNOWN RE- 
TENTION POINTS AT THE SAME TIME VALUES. 

r ^ . - 256 • . --^ 

:.■ • ■ ' ■ ■■ - ■ f- ■ 



• = COMPUTER WRITTEN 
! = OPERATOR ENTERED 



LPTEST INSTR: P(3J OF (9) 



THE COMPUTER THEN ASKS FOR EACH OCCUPATION CODE IN TURN AND THE 
OPERATOR ENTERS THE CORRECT CODES.. "FOR EXAMPLE, IF THE DATA FOR 
.322, 318 AND 201 ARE TO BE ENTERED, THE SEQUENCE IS AS FOLLOWS: 

♦ENTER THE OCCUPATION CODE FOR GROUP . " 

*N0. 1 : . 
•322 

* ^ 

*N0. 2: . . , 

!318, 

* ■■ 

*N0. 3: 

•201 ■ ' 

(IF. THE GROUPS BEING COMPARED ARE EITHER SUBSETS OF ONE OCCUPATION 
OR ALREADY-GROUPED OCCUPATIONS, THEN EACH SUCH GROUP MUST BE GIVEN 
US OWN NUMERIC CODE OF UP TO FIVE DIGITS.) 

NEXT, THE COMPUTER ASKS: ' 

♦ENTER. THE NO. OF X-VALUES 

THE . INPUTTED VALUE AT THIS POINT IS THE NUMBER OF RETENTION 
POINTS FOR WHICH DATA ARE AVAILABLE. FOR EXAMPLE, IF THREE 
YEARS OF DATA ARE AVAILABLE ,ArHE RESPONSE IS "3*. (NOTE - ^ 
THERE MUST .BE AT LEAST TWO X-VALUES.) " 

NEXT, T»^E COMPUTER ASKS FOR EACH X-VALUE IN TURN. THE OPER- 
ATOR INPUTS ETVCH TIME VALUE (INCLUDING AN AVERAGING FACTOR, 
ANY). USING THE HYPOTHETICAL DATA (WHO§E AVERAGING FAC- 
TTDR JS 0.5), THE PROPER X-VALUES FOR 1973," \37'i AND 1975 7 
Are 1.5, 2.5 AND 3.5, RESPECTIVELY. THE INPUT SEQUENCE IS 
AS follows: 

♦ENTER X-VALUE, 
* 

*NCf. 1 : 

!1.5 
* 

'*N0. 2: 
12.5 
* 



•NO. . 3: • • vvV^:?<Ms^'1 




FOLLOWING THESE E'NtR lES ,. ^HE COMPUTER ASKS FOR THE NUMBER OF 
EMPLOYEES WHO COMPOSED THE ORIGINAL GROUPS OF HIRES FOR THE 
OCCUPATIONS BEING COMPARED. THESE ENTRIES ARE REQUESTED IN 
THE SAME ORDER' AS THE OCCUPATION CODES WERE ENTERED ABOVE. 
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* = COMPUTER WRITTEN 
1 = OPERATOR ENTERED 



I. 



LPTEsf/rtJSTRS PC*) OF (9) 



-FOR EXAMPLE, IF DATA FOR ALL OF THE OCCUPATIONS COMPRISING THE HY- 
POTHETICAL DATA SETS ARE TO BE ENTERED AT THE OUTSET .. THEN . THE START 
ING GROUf DATA WOULD. BE INPUT AS FOLLOWS: 



.*ENTER STARTING POPULATION (N) FOR GROUP 
* 

*N0. 

! no 

* 

*N0. 

!125 
* 

*N0. 

•2 70 



NEXT, 

*ARE Y-VALUES IN 

* (.1) NUMBER OR 

* (2) PERCENT FORM? 

* (ANS 1 OR 2) V 



1 : 


( 


322) 


2: 


' ( 


318) 


3- 


( 


201 ) 



IF THE RETENTION DATA ARE IN THE FORM "NUMBER OF EMPLOYEES 
RETAINED FROM THE ORIGINAL GROUP-, THEN THE RESPONSE TO ' 
TH£ ABOVE COMMAND IS "1". 

ON THE OTHER HAND, IF THE DATA ARE IN THE FORM -PERCENT OF 
EMPLOYEES RETAINED FROM THE ORIGINAL GROUp-, THEN THE RE-.. 
SPONSE IS -2". ' . 

IF THE Y-VALUES ARE IN "NUMbER- FORM, THEN THE COMPUTER 
types: 

*IN INTEGER FQRM (NU DEC. PIS.). 
*ANO SEPARATE^D BY COMMAS, ENTER THE 

•Y-VALUES., CORRESPONDING TO X = - 

THE CbMPU#R:^TH£N INDIVIDUALLY PRINTS OUT 'EACH OF THE INPUTTED 
X-VALUES. 'fWE OPERATOR THEN TYPES IN EACH 7-VALOE ASSOCIATED 
WITH THAT X-VAtUE. THE Y-VALUES ARE ENTERED IN THE, SAME OR- 
DER AS THE' bcC-UPATION CODES WERE ENTERED ABOVE. THESE NUMBERS 
ARE ENTERED ON CNE LINE AND SEPARATED B7 COMMAS.'; FOR EXAMPLE, 
USING THE HYPOTHETICAL DATA. I'' THERE ARE THREE^X-VALUES (1.5. 
2 5 AND 3.5) AND THREE OCCUPATIONS (322, 318 AND 201 , IN THAT OR- 
DER), THEN THE INPUT SEQUENCERS AS FOLLOWS: 

* 1.5000: 
•58,77,225 

* • ■ i 

* 2.5000: \ 
•'i^*,. 62, 201 
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LPTEST INSTR: P(5) OF (9) 

• = COMPUTER WRITTEN' 
! = OPERATOR ENTEgEO 

* 3.5M0: , ' \ 

136.53,190*' , ^ 

IF THE Y-VALUES ARESSfN "PERCENT" FORM, THEN THE COMPUTER TYPES: 



*IN DECIMAL FORM, SEPARATED BY COMMAS, 

*ENTER THE Y-VALUES CORRESPONDHJG TO X =^ X 

USING THE SAME PROCESS AS ABOVE,. THE COMPUTER PRINTS OUT THE IN- 
DIVIDUAL X-VALUES. HOWEVER, THIS TIME THE OPERATOR ENTERS THE 
PERCENTAGE VALUES IN DECIMAL' FORM. (E.G.* 52.73% IS ENTERED AS 
.5273). THESE VALUES ARE ENTERED ON A SINGLE LINE, SEPARATED BY 
COMMAS, AND IN THE SAME ORDER AS THE OCCUPATION CODES WERE EN- • 
TERED ABOVE. FOR EXAMPLE, IF ALL THREE HYPOTHETICAL DATA SETS 
WERE BEING ENTERED (IN THE ORDER: 322, 318, 201 J, T-HE INPUT SE- 
QUENCE WOULD be: V . \. ' 



* 1.^5000: 

!. 5273, .6160,. 8333 \ 

* ■.. ■ „ 

* 2.5000: 

!,i»000,.l»960,.7AA4 / 



/ 



* 3; 5000: 

!.3273,.<»2'»0,*7037 



■ . <»>s 

• DURING THE NEXT SECTION 6F DATA ENTRY, THEf'oPERATOR CHOOSES 

WHICH OF THE INPUTTED GROUPS ARE TO BE COMPARE &. . THE FIRST 

QUESTION ASKED BY THE COMPUTER IN THIS SECTION IS: 

*DQ' YOU WISH TO TEST (O ALL OR (2) SOME OF THESE GROUPS? 
*(ANS 1 OR 2) 

■ , ■ -/ 

*IF, AT THIS TIME, ALL OF THE INPUTTED GROUPS ARE TO BE COM- 
PARED, THEN THE ANSWER TQ THIS QUESTION IS "1". LPTEST THEN 
COMPARES THE RETENTION TRENDS OF ALL OF THE OCCUPATIONS WHICH 
HAVE BEEN ENTERED TO DETERMINE WHETHER OR NOT THE ENTIRE SET 
OF OCCUPATIONS MAY BE GROUPED TOGETHER. 

IF T-HE OPERATOR WISHES TO , SELECT' OUT CERTAIN OF THE INPUTTED 
OCCUPATIONS FOR TESTING, THEN'THE ' RESPONSE TO THIS QUESTION 
IS "2". THEN THE COMPUTER WILL ASK: 

*HOW MANY GROUPS DO YOU WISH TO TEST? 

T«E OPERATOR^ THEN ENTERS THE NUMBER OF OCCUPATIONS THAT-ARE 
TO BE TESTED AT THIS TIME. FOR EXAMPLE, USING 'iTaHE HYPOTHET- 
ICAlVdATA, if THE OPERATOR WISHED TO COMPARE.32:2 AND 318, 
THENVTHE RESPONSE TO THIS QUESTION IS "2". 
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* = COMPUTER WRITTEN 



LPTEST INSTRS P(6) OF (9) 



TO FIND WHICH OCCUPATIONS ARE TO BE TESTED, THE COMPUTER ASKSS 

*WHICH ONES? ENTER USING THE GROUP NUMBERS 
♦ESTABLISHED DURING THE ENTRY OF OCCUPATION CODES. 
•(SEPARATE THES£ VALUES WITH COMMAS.) 

THE OPERATOR THEN ENTERS THE OCCUPATIONS TO Bt TESTED USING 
THE NUMBERS ASSIGNED TO THE INPUTTED GROUPS BY LPTEST DURING 
THE ENTRY OF OCCUPATION CODES. FOR THE. HYPOTHETICAL DATA, 
322' IS GROUP 318 IS ^UP ^2 AND 201 IS ^GROUP iJ3. SO, 

FOR EXAMPLE, IF 3 22 AND. ^18 ARE TO BE COMPARED THEN THE OPER- 
ATOR WOULD ENTER "1,2". THESE VALUES ARE ENTERED ON ONE LINE 
AND SEPARATED BY COMMAS. THE NUMBER OF VALUES. IN THE LINE 
MUST EQUAL THE NUMBER OF GROUPS TO BE TESTEO. . 

ANALYSIS outputs: 

AFTER ALL OF THE REQUIRED DATA HAVE BfetN- ENTEREO, T«E COMPUTER 
THEN PERFORMS ALL OF THE ANALYSIS NECESSARY TO DETERMINE WHETHER 
OR NOT THE TESTEO OCCUPATIONS CAN BE GROUPED TOGETHER. THE NEXT 
STEP IS THE PRINTOUT OF RESULTS. 

THE "LPTEST ANALYSIS OUTPUT" IS A COMPLETELY LABELED PRINTOUT 
WHICH CONSISTS of: 

(DA "STATEMENT OF GROUPING" WHICH TELLS THE OPER-' ' 
iSJOR WHETHER OR NOT THE TESTED OCCUPATIONS OR 
GROUPS MAY BE COMBINED; 

(2) A TABLE OF ." I NO I V I DUAL SUBGROUP DATA" SHOWING,- 
FOR EACH .GROUP tested: 

(A) THE OCCUPATION CODE,- 

(B) THt'NUMBER IN THE STARTING GROUP, -AND . 

(C) THE A-' AND B- VALUES OF EACH GROUP'S 
LOG-PROBABILITY EQUATION; AND 



(3) If the TESTEO OCCUPATIONS MAY. BE COMBINED, THE 
A- ANO B- VALUES OF THE L()G-PRdBABIL ITY EQUA- 
' TION FOR THE COMBINED (OR SUMMED) GROUP. 

E^ACH ANALYSIS OUPPUT Ll;STS THE OCCUPATIONS OR GROUPS WHICH WERE 
TESTEO DURING THE CURRENT RUN. 

FOR^EXAMPLE, IF ALL THREE OCCUPATIONS IN THE HYPOTHETICAL DATA 
SEfWERE TESTEO, THEN THE RESULT WOOLD LOOK LIKE THI??: 



* -COMPUTER WRITTEN 



LPTEST INSTR: P(7) OF (9) 



xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 



\ 



X 
X 

X 
X 
X 
X 
X 
X 
X 
X 
X 
X' 
X 
X 
X 
X 
X 
X 
X 
X 

y 



r-LPTEST AWALYSrs OUTPUT--? 



THE 



3 OCCUPATIONS TESTED 
322 
318 

201 , 
MAY NOT BE GROUPED TOGETHER. 



INDIVIDUAL. Sl>aGROUP DATAJ 



OCCN 



322 
.318 
201 



--L - P EQUATION— 
(A) (B) 



1 10 
12^ 
270 



0.31322 
0.52560 
1. 16436 



-U40587 
-^1 .32681 
-1. 19286 



X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 



X . 5^ . X 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 



SINCE THESE THREE OCCUPATIONS CANNOT BE GROUPED, A LOGICAL NEXT 
STEP WOUILD BE- TO COMPARE 322 AND 318 (THE CLERICAL OCCUPATIONS). 
THE RESULT OF SUCH A COMPARISON WOULD BE: 

XXXXXXXXXXXXXXXXXXKXXXX5(:XXXXXXXXXXXXXXXXXX 



X 
X 

X ' LPTEST 

X* ^ 



ANALYSIS OUTPUT- 



X 
X 
X 
X 
X 
X 

' X 
X 
X 

' X 
X 
X 
X 
X 
I X 



THE 



2 OCCUPATIONS TESTED 
■ 322 
318 

MAY BE GROUPED TOGETHER. 



INDIVIDUAL SUBGROUP DATA: 



.as 



OCCN 



322 
318 



-L - P 
(A) 



EQUATION-- 
(B) ' 



110 
125 



0.31322 
0.52560 



-1.40587 
-1.32681 



X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
)( 
X 
X 
X 
X 



■ 4> 
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= COMPUTER WRITTEN 



* 

* ' 

* 
* 
* 



LPTEST oINSTR: P(8) OF (.9) 



X ' 

X TOTAL GROUP EQUATION: , 

X . 

X Y = 0.'»2320 - 1 .35371X 

X 
X 



X 
X 
X 
X 
X 
X 
X 



XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 



AS YOU CAN SEE. .HIS OUTPUT CONTA/NS ONLY DATA FOR 322 AND 318 
AND, SINCEl THESE TWO OCCUPAT lONS/WERE FOUi^D. TO-BE COMPATIBLE, 
THE*"TOTAL GROUP EQUATION". 



Irecycling; ' . 

after printing this output^ the computer asks: 

*D0 YOU WISH TO TEST ANOTHER SUBSET OF' THESE GROUPS? (Y OR N) 

IF THE OPERATOR WISHES -TO TEST ANOTHER SET OF GROUPS FROM THOSE 
ALREADY-INPUTTED, THEN THE RESPONSE TO THIS QUESTION IS "Y". 
WITH THIS RESPONSE THE COMPUTER ' RECYCLES TO THE POINT WHERE 
'§ROUP SELECTION TAKES PLACE. 

IF THE RESPONSE IS "N", THEN THE COMPUTER ASi$:S: 

*D0 YOU WISH TO TEST ANOTHER SET OF GROUPS? (Y OR N) 

IF THERE IS ANOTHER SET OF GROUPS OR OCCUPATION^ TO BE' ENTERED 
AND TESTED, THEN THE RESPONSE TO THIS QUESTION IS "Y" AND THE 
PROGRAM WILL RECYCLE BACKTO THE BEGINNING. IF NOT,. THE RE- 
SPONSE IS "N" AND THE PROGRAM RU^ENDS. - 



GROUPING occupations: 

"COMPATIBLE** OCCUPAT IONS'mAY BE GRDUPED TOGETHER AND INPUTTED 
INTO "LPTEST". THIS IS DONE BY SUMMING THE Y-VALUES (NUMBER 

Detained FOR EACH x-value. the -starting group" figure would 

BF THE SUM OF THE INDIVIDUAL STARTING GROUPS. IF PERCENTAGE 
5al1eI ARE DES RED THE INDIVIDUAL X-VALUE SUMS CAN BE DIVID- 
eS BY tSesSmMED STARTING GROUP FIGURE. FOR EXAMPLE USING_^ 
THF HYPOTHETICAL DATA, ASSUME THAT 322 AND 318 CAN BE GROUPED 
TScEy^ERl THEN tSe sGmS FOR THE INofvipUAL, X-VALUE^-WOULD. BE: 
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LPTEST iNSTRt'^PtSi OF (9) 



. X-VALUE Y-VALUES SUM 

1.5 58+77 =135 

.2.5 = 106 

3.5 36+53 =89 

'■ ( ■ ' ■ ■■ ■ ^ • ■ 

THE STARTING GROUP SUM IS: 110+125 = 235. 

THE PERCENTAGE VALUES AREs 57.^5%, '»5.11%, 37.87%. 

THESE GROUPED FIGURES CAN BE COMPARED WITH OTHER GROUPED FIGURES 
OR WITH OTHER INDIVIDUAL OCCUPATIONS. 

THE RETENTION VALUES FOR GROUPS WHICH ARE SUBGRO'UPs/oF ONE OCCU- 
PATION ARE COLLECTED BY DIVIDING THE STARTING GROUff FOR THE OC- 
CUPATION INTO THE DESIRED SUBGROUPS (E.G";* MALE - FEMALE) AND 
FOLLOWING EACH SUBGROUP OVER TIME. THE COMBINED VALUES FOR THESE 
SUBGROUPS WOULD GIVE THE RETENTION PATTERN FOR THE WHOLE OCCUPA- 
TION. 



5) 
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APPENDIX E-3 



PROCRAf^ LISTING 



■0. Q 



STAFFING NEEDS. PLANNING COMPUTER PRLQRAM: 

: 

< ; 
♦ ' LPTEST 



8L1REAU OF POLICIES AND STANOARDS 
UNITED STATES CIVIL SERVI^CE COWISSION 
: WASHINQTGN, B,.C. 20^15 ^fj^ 
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LPTEST 

J 



THIS- PROdRAM COMPARES TWO ,0R MORE SETS OF LCNGITU 
OINAL. RETENTION OATA TO DETERMINE WHETHER ANY OR 
ALL OF THEM MA^Y BE GROUPEO TOGETHER FOR LOG-PROB- 
ABILITY ANALYSIS. IN MOST CASES, THE COMPARISONS 
WILL BE BETWEEN TWO OR MORE OCCUPATIONS ALTHOUGH. 
ITHER FACTORS MAY BE USED. 

i-ONGITUDINAL RETENTION OATA POINTS ARE ENTERED IN- 
TO THE PROGRAM BY THE USER. 

REQUIREO subprograms: ;\NDPX, ANDXP, FTEST 
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' . • ' ' ' ' PAGE 1 

LPTEST i ■ ^ o , 

DIMENSION X(20),XLOG(20) /I Y(20, 20), Y(20. 20); YSD(20;20.).NO.(ZQ^^^ 

1 ,A(2G),B(20),I0C(20),NXX('»0), . ' ) 

2 YPT(20,20),SMP(2O),SMP2(2d),NUM.(20),YD(2Q,20) 

• DATA(NXy(I),I = .l.40)/'tO*'XX'/ : 

WRI TE(5 5) ■ " - ■ • 

5 FORMAT (ioCV). IX, 'THIS PROGRAM ANALYZES AND' COMPARES THE , 

1 - RETENTION TRENDS OF 2 OR MORE'). 

6 • rtRMATOxi'OCCUPATILNS TO 'DETERMINE fHETHER THEY CAK 

1 ' GROUPED .TOGETHER FDR LOG-^). 

WRITE('5,7) ■ ,v • . 

7 • FORMATCIX, 'PROBABILITY ANALYSIS. ) - 

/ WR I T E ( 5 i 0 ) ' 

10 FdRMAT(/, IX, 'FOR PURPOSES OF THIS PR0GRAM:',/,6X,'<1)' 

1 -THEV^X - VALUES" = LENGTH OF SERVICE COMPLETED , 

• 2 '; AND") 
WR I TEfS. .11) 

, 11 F0RMAT(6X,'{2) THE. "Y - VALUES" = NUMBER (OR PERCENT)'. ,^ 

1 ' RETAINED. AT TIME X.') 
WR I T E ( 5 1 2 ) 

12 FuRMATC/, IX, 'SEE INSTRUCTION MANUAL FOR FURTHER EXPLANA' 

1 'TIdN OF DATA REQUIRED.') 

30 FORMAT(///!lx\'ENTERfTHE NO. OF RETENTION GROUPS TO BE COMPARED 

READ(5;,50)NC 
50 F0RMAT(I2) 

WRITE(5,60) 

60 FORMAT(//, IX, 'ENTER THE OCCUPATION CODE FOR GROUP ) 

DO 65 1 = 1 ,NC ' ■ '■■ J ' 

WRITE(5,l'»Q)r - 
READ(5,62a^ICC(I-). 

62 FORMAT (1 5) . ' " . ' 

65 CONTINUE ^ . ». 

WRITE (5, 70) . > 

.70 FORMAT.(/, 1X,'ENTFP THE NO. OF X-VALUES ) 

READ(5,50)NX - - 
90 F0RriAT(I2) 

WRITE(5,110) ^ 
110 FORMAT(//, IX.'FNiFn x-V/\liiF ) 

DO 180 1 = 1 ,NX 
WRI TEC 5,1 '♦0)1 
ll»0 FORMATC/, IX, 'NO. . I /" , ' ^ 

READ(5,160)X(I) - 

160 F0RMAT(F7-'*) ^ 

XLOGd ) = A1. OGlO(y( 1) ) 

180 CONTINUE 

- 200 FORMAT(//??X,^FNTER '^lAPTING POPULATION (N) FOR GROUP') 

QO 260 1=1, NC 

WRITE(5, 230) 1 , ICCd ) , 
230 FORMAT (•/", IX, 'NO.. ' ,U ' (' . I , ) ) 

READ(5,250)N0(I ) 
250 F0RMAT(I6) 
260 CONTINUE 

WRITE(5,280) j 
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LPTEST PAGE 2 

280 . FORMAT {//, IX, 'ARE Y-VALUES IN',/,4X,'( 1 ) NUMBER OR',/, 
.1 4X,*('2) PERCENT FDRm^ / ,4X ( ANS l.OR 2)') 
.READ(5,300)IP NT 

300 F0RMAT(I2) 

IF(.IPiEQ.2)G0 TO ^60 ^ 
WRITE(5,330) * 

330 FDRMAK//, IX.'IN INTEGER FORM (NO DEC. PTS.)',/,1X, 

■ 1 'AND SEPARATED BY COMMAS, ENTER THE',/, IX, 
2 'Y-.VALUES CORRESPONDING TO X =') 
DD '♦40 1 = 1 , NX - , V 

WRITE(5,360)X(I) «• 

360 F0RMAT(/,1X,F7.A,':') 

READ(5,*)(IY(I,J),J=1 ,NC) 
DO kkO k=r,NC 
^EN=NO(K) 

T Yd ,K)=IY(I,K) 

• ' Yd ,K) = Y( I ,K)/EN*• 
(.YSD( I ,K)=ANDPX( Y( I ,K) ) 

i»i»0 ✓JCONTINUE 

GO TO 580 , 

k60 *WRITE(5,470) 

470. FORMAT(//, 1X,'IN DECIMAL FORM, SEPARATED BY COMMAS',/, 

V IX, 'ENTER ?HE Y-VALUES CORRESPONDING TO X =' ) 

DD 570 1 = 1 , NX .. ' 

WRITE(5,500)'X(I) ' 
500 FORMATX/, 1X,F7.4,':') 

READ(5,*)(Y(I,J),J=1,NC) 

DD 570 K=l ,NC 

EN=NO(K) 
. YSD( I ,K) = ANDPX( Yd .K ) ) 
570 CONTINUE ^ 

580 WRITE(5,585) ^ 

585 FORMAT(//, 1X,'D0 YOU WISH TC TEST (1) ALL OR (2) ' 

1 'SOME OF THESF GROiir'^? ./.tX.'CANS ) OR..^)') 
READ(5,300)MS ' t"\ 

IF(MS.EQ.2)G0 TO "-'^^ 
DO 590 10= 1 .'T • 
590 NUM(IQ)=IQ 
NG=NC 
, GO TC 6)0 
.595 WRITE(5,600) 

600 FORMAT(//, )X, '"^'W ORO'if- pn vny wi<;i( in ir<;r7 ) 

READ(5,300)Nn / 
WRITE ( 5, 605) ■> 

60<i FORMAK//, IX, 'WHICH ONES? fNTER U-^ING THE GHOUP Niin 

1 "BERS',/,)X, EST/iBl.ISi"" f^i'f'IM'. TMF TNIRY Of 

2 ' OCCUPATION CODF-^.') 
WRITE(5,607) - 

607 FORMATdX.'CSEPonftTE THE"'' v/Mur<; with rnfiMAS . ) ) 

READ(5,*) tNUM( I ) . T i.'iG) 
6)0 ENTOT=0.0 / 

NSUM=NC-»') 

00 630 1 = ) .NT, 

LR=NUM(I) 

ENO=NO(LR) . ■ r 
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LPTEST 

630 ENTOT=ENTOT+ENO \ 

NO(NSUM)=ENtOT " , ^ 

DO 710 1=1, NX 
Yd ,NSUM) = d.O 

DO 680" J=1 ,NG ' 

LQ=NliM( J) • 

ENO=NQ(LQ) 

YD(I,LQ)=Y(I,LQ)*ENO 
. Yd ,NSUM) = y(I,NSUM)+Y6(I,LQ) 

680 CONTINUE 

Yd,NSUM) = Y(I,NSUM)/ENTOT 
YSD(I,NSUM)=ANDPX(Y.(I ,NSUM)) 

710 CONTINUE 
KS=1 

720 LS=>1UM(KS) 
730 SUMX-0.0 

SUMY=0.0' 

SUM^Y=0.0 

SUMX2=^.0 

DO 820 1=1 , NX 

SUMX=SUMX+XLGG(I ) 

SUMY=SUMY+YSD(I ,LS) 

SUMXY=SUMXY+XLCG(I )*YSD(I ,LS) 

SUMX2=SUMX2+XLQG(I )**2 

820 CONTINUE . 

ENI=NX 

• S1=ENI*SUMX2-SUKX*SUMX 
S3=ENl*SUMXY-SUMX*SUMY 
B(KS)=S3/S1 

A(KS) = SUMY/ENI-(8(KS)*SUMX)^/ENI 
IF(KS.EQ.NSUM)GG TO 885 
KS=KS+1 

IF(K.S.LE.NG)GO TO 720 
KS=NSUM 
. LS=NSUM 
GO TO 730 
885 JP=0 

DO 890 MM=f,NX 
DO 890 IK=1.NG 
LL=NUM(IK) 

YPT(MM.LL)=ANDXP( YSD(MM,LL) ) 
890 YPT(MM.LL)^YPTnif'.i I )*ino, 

DO 895 IG-K'i^- 
JL=NUM(IG) 
SMP(-JL) = 0.0 
SMP2( JL)=0.0 

DO 895 KL=1,NX 
SMP( JL)=SMP(JL)+YPT(KL.JL) 
SMP2( JL) = SMP?( Jl ) -YPT (Kl. JL )**2 
895 CONTINUE 
SUMP=0.0 
SUMP 1=0.0 
SUMP2=0.0 
DO 900 KA=1,NG 
KB=NUMtKA) 
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910 
999 
1 000 

1010 



1020 

1030 

]OhO 
1 050 

1060 

1070 

1080 

1090 
1 100 

1110 
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LRJEST 

: . sump=sump+smNkb) 

sump2=sump2+smp2(kb) / 

SUMP1=SUMP1+SMP(KB)**2 
PC2=SUHP**2 
ENI=NX*NG _ . 

ENK=NX ■ ■ 

EN2=NG*(NX-1 ) 
EN3=NG-1 

VAM=SUMP2-SUMP1/ENK 
VAB=SUMP 1 /ENK-PC2/EN 1 

VARM=VAM/EN2 ' ' 

VARB=VAB/EN3 

F.2=VARB/VARM . 
NGMS=NX 
NBAS=NX*NG 

1F(F2.LE. 1'. )G0 TO 910 ' , 

CALU FTEST(NGMS,NBAS,F2,EP) 
IF(EP.LE.0.05)JP=1 
GO TO 999 ' 
EP=1. 

WRITE(5, 1000)(NXX(I ),I=1 ,21 ) 

FORMAT(/////, 1X,21A2) _ # 

WRITE(5,1010J " 
X.'X'.'^OX.'X"). 

1010) - 

1020) ^ 

X.'X -LPTEST ANALYSIS OUTPUT ' 

) ■ 
1010) 
1010). 
1030)NG 

X.'X THE —.12,' OCCUPATIONS TESTED' , 1 3X , 'X' ) 
1=1, NG 

) • • ■ 

10'»0)I0C(LU) 
X.'X'.SX, I 5,29X,''X'') 



FORMAT ( 
WRITE (5 
WRI TE(5 
WRI TE(5 
FORMAT ( 
WRITE (5 
WRITE(5 
FORMAT ( 
WRIT£(5 
FORMAT ( 
1 ^X," 
WR IT E ( 5 
FORMAT( 
1 '♦X.'X") 
WRITE(5, 1010) 



FORMAT ( 
WRITERS 
■WRITE (5 
FORMAT( 
1 ' X 
WRITE(5 
WRITE(5 
WRITE(5 
FORMAT ( 
DO 1050 
LU=NUM( 
WRITE (5 
FORMAT ( 
CONTINUE 

IF( JP.EQ.O)WRITE( 5, 1060) 
FORMATdX.'X MAY BE GROUPED TOGETHER, 
IF(JP.EQ. 1 )WRI TE(S, 1070) 



■, UX.'X") 



X, X 

1010) 
1010) 
1080) 
X.'X 
1010) 
1090) 
X , 'X' 
1 1 00 ) 
X.'X' 
') 

1110) 
X f X , 6> X , 



MAY NOT BF nnnijPFO TOGhTHER,'', lOX^'x") 



20X,''' -I P fOUAT I0N--''',2Xp"X'') 
6X . "OCCN ^4 X , " . 8X , ' ( A ) ' , 7X , " ( B ) ^ , 
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fr''^'''" V '•• i' ■ ■■ ■■ : - .' ■ ' .• ■ • .■ - 

v^''" f .'OO' ■v'l3G'--M^l,NG ■. ' ■ • :• ' , 

l:.^,"v. . . ■ •■ . JM=NUM(«) ■ , 

. . - ■ . JQC=IOC(JM) ' . 

. JN=NO(JM) ^ . 

, . .WRiTE(5, 1120) JOC,JN,A(M),B(M) . 

\\20 FORMAT(lX,'X',5X,I5,2X,I5,3.X,F8.5,2X,F8.5.2X,'X') 
. 1130 ^CONTINUE \ . , ' . ' 

IF( JP^EQ.l )G0 TO 1154-^ 
. : WRITE(5,1010) « .. 

WRITE(5, 1010) 

WRITE(5,1 140) . 
• iri»0 FORMATCIX.'X TOTAL GROUP EQUATI ON s' , 17X , 'X') 
WRITE(5,1010) 
B(NSUM)=ABSIB(NSUM)) 
WRITE(5,1 r50)A(NSUM) ,B(NSUM) 
1150 FORMAT( 1X,'X',6^<;'Y - ',F8.5,' - ',F8.5,'X', 10X,'.X') 

1,155 . WRITE(5, 1010) 

WRITE (5, 1010) ■ 
1160 WRITE(5,1165)(NXX.(I).I=1.21) . . ' . 

1 165 F0BMAT(1X,21A2) 

WRITE(5, 1 1 70) 

n/O TORMAT(////////. IX, 'DO YOU WISH TO TEST ANOTHER 
1 'SUBSET OF THESE GROUPS? (Y OR N)') 

READ(5, 1 180)LAG - 
1180 . FORMAT (A2) 

IF(LAG.EQ.'Y')GO TO 580 
, WRITE(5,1190) V. 
1190 'I, FORMAT(//, IX.'DO YOU WISH TG TEST ANOTHER SET 01% 
- GROUPS? (Y OR N)') 
/ . READ(5, 1 180)NAG 

IF(NAG.EQ.''Y'')GO TO 20 
/ STOP 
/ END 
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i^'^i SU^^ FTEST(NGMS,NBAS,F,P) ' . nr run ' 

-C '"- ... APPROXIMATES THE PROBABILITY OF CHANCE OCCURRENCE OF T^^^ . . 

?^C-. . 4 OBSERVED VAl:UE OF F. GIVEN NGMS (= N OF GROUP TESTED -- BY 

d "DEFINrTION^ THE GREATER MEAN SQUARE) AND NBAS (= N OF THE 

C " - • ' BASE GROUP). 

. C, V REQUIRES SUBPROGRAMS FUNCTION ANDXP. , > . „ „, -, ^ 

C i'^ - (REFERENCE: NBS, ABRAMOWITZ & STEGUN.AMS 55 , 9TH, 26 . 6. 1 5 ,P . 947.. ) 

. ■. , • V-i=NGMS- 1 ' -V,.";^. 

■ r:- •"y2=:NB/Ss-i • 

EX=1./3. . 
F3=F**(EXK 
F23=F3;**2 \ 
B1=2./(9.*V1 ) 
,B2=2./(9.*V2) 

'Tl = l.-B2 r 
T2=l.-Bl 

T0P=F3*T1-T2 • 
B0T=SQRT(B1+F23*B2) 
A X=T0P/B0T • . . 

Q=ANDXP(X) 
. . P=l ;-Q ^ 
RETURN 

. END • . 
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ANOPX PAGE 

C ACCUMULATtvE^iSoRMAL DISTRIBUTION FUNCTION. ■^^ ^ 

C APpSmaTES ^ A* or - .000004) FROM P. (HASTINGS, P» 92 > 

- SATAni:A2:Bl.B2,B3/2. 515517.. 802853. .010328. 1.432788. 

V - 1 ,189269.-001308/ 
IFCP.GE.-i.) Xl=4. 
IF(P.GE.1.) GO TO 380 ' 

IF(P.GT. 0.0005) GO TO 70 ! 
Xl='4. , 

GO TO^ 380 

70 IF (P.LT.0.5) GO TO 110 . 

. E.= SQRT(ALOG ( 1 . / ( 1 . -P )**2 ) ) ■ ^ „ . , o 

XI = E-((A2*E+A1)*E+A0)/(((B3*E+B2)*E+BI)*E+1.) 

GO TD 140 

no P = l.-P X 

E = SQRT(AL0G(1./(V.-P)**2)) . . ^ 

XI = -1.*(E-((A2*E+A1)*E+A0)/(((B3*E+B2)*E+B1)*E+1.),) 





. P = 


l.-P 




140 


AX1 


= XI +0.0005 






AX2 


= XI -0.0005 






API 


= ANDXP(AXl) 






AP2 


= ANDXP(AX2) 






AI 


= (P-AP2)/(AP1 


-AP2) 




BXl 


= AX2+AI*(AX1 


-AX2) 




Dl 


= P-ANDXP(BXl) 





AX3 = BXl+1. 1*D1 
AX4 = BX 1-1.1*01 

' AP3 =. ANDXP(AX3) 

AP4 = ANDXP(AX4)- 

AII = (P-AP4)/(AP3-AP4) 

BXl = AX4+AII*(AX3-AX4) 

Dl = P-ANDXP(BX1 ) :, 
290 IF (Dl .LE. 0.0000005) GO TO 370 

IF (Dl .GT. 0.0000005) GO TO 340 

BXl = BX1+0.0000O02 

Dl = ANDXP(BXI) ( 

Gd TO 290 
340 BXl = BXl-0. 0000007 

01 = ANDXPC?^' ) 

GO 'to 290 
370 ANDPX = BV1 

GO TO 39^ 
380 ANDPX=vi 
390 RETURN 

END 
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fUNCTIdH ANDXP(X) . 

ACCUMULATIVE NORMAL DISTRIBUTION FUNCTION. APPROXIMATES P 

FROM X (TO + OR - .0000001). (NBS-55, P. 932) 

Xi = ABS,(X) ' 

T = V./O . + .23l6iH9*Xl ) i 

ANDXP = l.-.3989'»23*EXP(-(Xl**2)/2.)*( .3193615*T-.35656^*T** 

1 +1 .78U78*T**3-1 .82125^*T**'» + 1 .33027^*T**5) 

IF (X.LT.0.0) ANDXP = l.-ANDXP 

RETURN 

END ^ 



- / 
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'STAFFING NEEDS PLANNING COMPUTER PRC.GRAi^: 

GS810 „ - - 



BUREAU or POLICIES AND STANDARDS 
UNITED STATES CIVIL SERVICE COMMISSION 
- WASHINGTON, D.C. 2Qk\S 
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* . GS810 TECHNICAL ANALYSIS / 
Purpoise : . ' 

Computer terminal program 6S810 is designed 1^ enable personnel workers 
without extensive statistical l>ackground to: ** 

(1) Mak^ detailed quantitative analyses and projections of the 
grade-advancement patterns of individual occupations of 
employee groups for use in recruitment and counselling 
programs, workforce and budget studies, and organization 
staffing needs planning programs; 

(2) Make objective comparisons of the grade/time pattern of one 
occupation or employee group with that of another , for "use . 

in equal employment opportunity, career planning and in-service 
placement programs; and . 

(3) Identify by objective means, for purposes Of executive develop- 
ment, performance evaluation, and^occupational standards study 
programs , those individuals or subgroups within an occupation 
or organization whose advancement trends have been ^ignf icantly 
above or aignficantljr below the prevailing norm for this group. 

In accordance, with these purposes, GS810 has been designed (with its 
companion subprograms )'^as a comprehensive and is^^lf-contained unit which 
will do the complete job of . turning raw input data into the finished 
output projections and evaluations desired by the program operator without 
any need ^for operator technical participation and without any need for 
operator reference to outside assistance sources (statistics, texts , look- 
up tables, etc.) in order to evaluate, the significance of, calculation 
results.- , H"" - 

Method / • 

Input Files - Data are input to the program from previously stored 8 X 10 
integer files which are accessed by means of a 6-space file-name code 
(one code system is explained in the operation mainual)^ By this means, 
the operator need do no data entry: only type in the code name of 
the (preViously-entered ai^- -verified) files chosen for study and 
they are. automatically read into the p^rogram. 

Thife technique is extremely flexible since it frees the operator tjc^^ 
select and combine any flies (up to a maximum of 8) and in any^^order 
or combination. The technique is. also extremely' reliable because all 
data items in each file are brought into the program with 100% accu- 
racy; i.e.V without the inevitable errors of manual entry. And, of> 
course, this technique is far more efficient in the accessing and , 
manipulation of files than is the case with laborious manual data- 
entry methods. * . 
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Files are in integer format for maximum ease and accuracy of data entry 
and for minimum-space storage. Since people-count : tables , characteris-. 
tically, 'involve no decimal fractions, integer format (no decimal points 
specified) is (a) sufficient, (b) saves both the work and the inevitable 
errors of entering decimal points, and (c) conserves one digit, of 
Storage space for every number entered. - . 

<Dat a Arrangement - The program is formatted for 8-grade by 10-year grade 
liraiiribution tables, ('^8 x 10 arrays** in computer usage). An 8-grade 

■pattern was selected to provide full coverage of the eight grade -advancement 
steps from grade 5 to 15* in the General Schedule pattern. Any other, 
uniform- step., equal-interval progression pattern desired, however, whether 
involving grades or, in the alternative, salary categories, can be entered 
by the operator. If less than 8 grades (rows) of data are tg be used, 
the unused rows in the files should simply be filled with zeroes 
(0,0,0. ..etc.) 

A »'f irst-lO-year " pattern was selected for the length-of -service 
(columnar) dimension for several reasons: ^ 

(1) The most rapid grade rise in any group occurs in the first 
10 years of service; thus the cise of the group's character- 
istic advancement curve is largest and, hence, most accurately 
measured in this initial period. 

(2) This span^^covers those length-of -service groups which are the 
nufnerically largest in most organizations: groups whijCh are 
of special interest in both analysis and projection becaus^ 
of their importance for the or ganization * s . expected future.^ 
And / 

nv TTie span of the la^t ten years encompasses most or all of the' 
p,eriod of our most intencive national efforts toward equal, 
employmenf opportunity' data for this period tl^us constitute 
th'=' clpar9Rt available test of bpth (a) the objective results 
of rpce^^t ^Mr'ng and pfomot'on policies, and (^0 the probable 
roejiilto t'^ hrs o<»#p*.'tof1 <i o^i <-!>^«?rk pr>lirip«5 for the futvire. 

As with uni)9^^^ ro^^p, ap mpnfion«-rl above, if ' psr th"n 10 years* data 
are avnil^ble (a' t o pre ■■ust), the -nuse'^ file columfis shonl'l 



*So that grade adv?»ncement. ^urve^ in t?he ^^two-grade-i nterval" 
occupations follow the s^me "one digit^one step" irate be^ow GS^ll 
that they do above GS-11 (which is essential for curve computation 
purposes) the twordi git- inter val grades of GS-5,7, and 9 are Converted 
to Rrade weights of "8'\ anH '•1.0'» respectively. , 
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Other than whole - or half-year8r;.are desired (e.,g.;, 'where there are 
/irregular time intervals b^tv^elj^columns) , the desired fe,ol\iiiia-v^aluea V 
can be entered by the opera tpri^* 

Trend Analys is - Since grade-advancement trends are ris^ig curves Whose 
rate of increase slows rapidly over time, the appropriate curve, form, 
for such trends is one of the family of ''growth" or ''meftiira tion" curves,. 
The two most widely-useds^ in economics and population studies are the 
Pearl-Reed Logistic and the Qompertz: ^ 

(1) 1/Y = a+bc^ /Logistic) ^ , ^ 

(2) log Y = loga + logb(c^) (Goinpertz) 

' ■■ - ^ > ■ ■ , ~ / ' ■ : 

The* specially useful feature of these familiar curves, of cotirse, is 
^he inclusion," among the multiplicative . terms , of the additive term, 
"a." From this term the curve asymptote--^the maximiSm ultimate value 
toward which the curve is tending as thie increasing value of x makes 
c^ approach 0*--is easily determined. It is thid additive character 
of this term, however, which makes it possible to ,fit* such curves to 
.ctual data only by approximate methods, and even then, only I/- 

(a) when the total number of observations is ^ome multiple ' 
of three (6, 9, 12, etc), ' 

(b) when all time intervals betweem observations are exactly 
equal, and . ■ 

(c) when no observations in the sequence are missing.' 

In purely practical terms, then, such severely restrictive terms are, 
to say the least, not always met in the average personnel management 
"operating situation. The use of these commonly-seen curves is therefore 
inappropriate for a program intended for a wide range of applications. 

(As a- final consideration it is technically sufficient to note that 
the deviation of the observed values from the trend lines fitted by the 
approximate methods necessary for these curves 

"(a) are not minimal in size, as in least- squares fits, an^ ^ 

(b) are not equal in sum on both sides of the trend line, as 
y ' needed for reliable tests of variance.) 

A third type of maturation curve is a relative of the log-probability 
decay curve which we have established** as the exact form of the work 
group retention curve: 



X ) 

*The G term is always less than 1, thus c rapidly becomes smaller 
with increasing values of x. 

**H.L.,. Clark, "Problems and Progress in Civil Service Manpower Planning, 
NATO Conference on Manpower Planning Models, Cambridge, U.K., Sept. 1971 
' ' ^ - 283 - 
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' (3) z(y) = a + b log x 

. ■ ^ ' 

However, this curve can be fitted only to longitudinal data (where 
all data are observations of the same group at different points in 
time) and only when both the original size of ^ the group and its size^ 
at the time of each grade observation are known. 

The fourth growth curve form is tlic ^ciuciai category ot the exponential, 
which includes such forms as • ^ 

X b - 

(A) Y - ab oj. Y ax 

These curves cau be couvei ted, with Liic use of logai:ithms, to the 

linear forms: n 

(.A.^; log Y - log a X log b or ; 
(4.b) log Y = log a^b'log X 

Since such liueai tormb Jo pciuiiL the use of least-squares fitting 
techniques, they t-an^e^idiiy be a|jplied: ^ 

(i) To ciObcr^-s^cLi^.ii JaL^, as well as longitudinal data, where th^ 
observed population in one column typically may' differ con- 
siderably from the ^copulations of other cojlumns , 

U; Wheic obboi vaL iwuc* ai scpaj-aLiid by unequal time intervals, 
and 

(j; Wlieie ou^ v,i itu,, wbacL vaLlous in a series are not available 
(a missin^^ y»-^J^> -^^-) 

•V- 

Because ot Llilo iiiic^i toLu.; Ltic exL t apo 1 at ipn of such curyes is 
technically very easy. And because they are fitted by leas t^squares 
techniques, variance aiuilysis is sLt aightf orward. ' 

As a' tina; note, Llie ^loouncao with which curves of this equation form 
can be fitted to the 10-year data used in this program is statistically 
scarcely distinguishable, for moderate to small size samples, from the 
fits obtainable from the log-probability curve form* 

Computations 

GS810 performs a series of computations on the data 'input from the , 
. called files: • 

• ' \ 

(1) The files are added together to produce a tdtal grade distri- 
bution table of all entered groups, with tfie row^yalues 
(i.e., grades or weights) and the column-values (t*e., years) 
specified by the operator. 
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(2) The row-values (Y- values) and the; column- values CX-Values) are 
■' then converted to logs and a weighted linear trend line of 

form 4.b is calculated. (The coefficient of linear correlation 
(r) is calculated as a measure of the proportion of total 
variance which is accounted for the trend linq.) 

(3) The value of the regression equation is then calculated for 
each column. The squared deviation of each block of Column 
data from the regression' ^ne is determined, multiplied by 
the block frequency, and the overall »sum of variances is 
divided by the total group N-1 to produce the overall group 
variance. The program, then prints out (l) the equation, (2) 
r, (3) N,. and (4) the group variance. ^ « 

(4) At operator option, the program iterates regression line over 
35 years. 

4 

(5) At operator option, the program fits the overall group '^norm" 
*line of regression to the distribution of each subgroup a;^ 

determines the variance of the subgroup from the overalL 
"norm." This subgroup variance is then compared with the 
variance of the overall group from "the same curve and the 
^ / significance of the difference is measured. 

Significance Test ^ The significance of the ratio of the individual 
subgroup variance to the total group variance is evaluated by means of 
the F-ratio test, one of the most fundamental, reliable and flexible of 
statistical tests. 

The, value of F is given by: 

F = Variarfte of subgroup 

Variance of total group , 

* 

* ' n2 (Total gr6up degrees of freedom) = - 2 w 

* ' 

The probability - value of the resulting F-ratio is calculated by means 
of the Q(F) approximation given by Abramowitz and Stegun (26.6l.15 AMS 
' 55, 9th, 1970) and the significance of Q is evaluated with the usual 
fiduciary limit of p = .05. (The variance of the total group is used 
as the denominator in order to minimize the probability of "Beta" error: 
the apparent finding of a "difference" where in fact no difference 
exists. ) 



Sample SifeV - The use of the F-test to evaluate the results of the pro- 
gram! s calculations is one of GSSlO's key features, not only because of 
this test's well-known power and validity, but--most especially--because 
of its ability to produce Valid results from .samples smaller than those 
of almost any other statistical test : pnce the overall group "norm" 
advancement curve is known, reliable comparisons with this curve can 
be made by means of the F-test for subgroup samples as small as 3. 
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Special Program Features . 

Certain special features of GS810, in addition to its overall design 
as a completely serif-contained system, are worth special mention: 

1. The operation manual describes a file code system the operatqr 
may use in naming, storing and calling files. 

2. The program includes detailed step-by-step guidance to the 
operator in the selection of column- and row-values. 

* 3. . The program offers the operator a wide range of butput and 

analysis options, each of which will be executed automatical? 
at the operator's chpice: 

a. ) A write-out of the complete grade •distribution of the 

overall group (i.e., the sum of all entered files). 

b. ) A 35-year career-progression projection of the computed.. 

"norm" curve. * 

c. ) An F-ratio test of each subgroup relative (l) to the 

"norm" curve pattern of the entire group or (2) to the 
entered equation and variance of an gutside group. 

-d.) A write-out of the complete grade distribution -of each 
individual group entered into the program 

4- Every output or "results" section written^out by the program 

includes the full file-code name of th^^^^oup to which it referV 
(including, if the group is the sum of suljgroups , the full file- 
code name of every subgroup included the total group). 

Automatic Run Option 

When each program run-through with a group of files is completed, the 
operator who asks for another run with new files is offered the option 
of having that next run in an automatic mode: once he specifies the 
new files he wishes to analyze and the cocTe he wishes to use for the 
sum of the entered subgroups, the entire program will thereafter run 
through to. completion, with the exact same pattern of analysis and 
outputs which he. specified in hi's prievious step-by-step run-through, 
6n a cdmpletely automatic basis without further operator participation* 

On each run-through of GS810, that is, the program records ^very 
program option chosen by the operator and if no change is desired in 
this pattern, each subsequent grgup of fi^es called by the/operator will 
be processed in exactl^r the same way completely automatically. With 
^ this option, then, any desired pattern of options can be repeated with 
successive groups and combinatior^ of files withjan absolute minimum 
of operator effort and with a maximum of speed and efficiency. 
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STAFFING NEEDS PLANKING COMPUTER PRCGRAM! 

GS810 



-O- 



i 

Of POLICIES AND STANDARDS 
UNITED STATES CIVIL SERVICE COMMISSICN^ 
WASHINGTCN. D.C. 20^15 
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GS810 INSTR: P(|) OF (8) 
. (FILE CREATION) 



DATA ARE ENTERED INTd'TflE "GS810" PROGRAM FROM PREVIOUSLY-STORED 
FILES. THESE FILES CONSIST OF 8 ROWS AND 10 COLUMNS. EACH ROW 
REPRESENTS A GRADE-LEVEL; EACH COLUMN A SPECIFIC LENGTH-OF-SER- 
VICE CATEGORY. 

FOR USE WITH THE "GS810" PROGRAM, DATA MUST BE COLLECTED BY 
LENGTH-OF-SERVICE CATEGORIES. A COUNT SHOULD BE MADE OF THE 
NUMfiER OF EMPLOYEES IN EACH SUCH CATEGORY (E.G., 0- J YEARS, 
1-2 r6ARS, ETC.) FOR EACH GRADE-LEVEL (E.G., GS-5,7,9, ETC.). 
SUCH COUNTS SHOULD BE MADE FOR EACH OF THE, MAJOR. OCCUPAT lONg. 
IN AN ORGANIZATION. (THESE DATA MAY COME FROM EXISTING AUTOMATED 
SYSTEMS OR FROM ANY OTHER SOURCE AVAILABLE TO AN ORGANIZATION.) 



A SAMPLE OF SUCH A COUNT MIGHT BE: 
OCCUPATION s GS-OOXXX 



GRADE 

5 

7 

9 
1 1 
12 
13 
\k. 
15 



0-1 1-2 



3 

5 
6 



2 
2 
3 
k 



643 

J- 2 



LENGTH OF SERVICE (YEARS) 
2-3 3-4 i»-5 5-6 6-7 7-8 



8-9 9-10 



1 

2 
1 
1 
1 

2 



THIS TABLE CAN VERY SUPPLY BE CONVERTED TO A "GS810"-TYP£ FILE 
SUCH FILES CONSIST OF 8 CONSECUTIVE TYPED LINES (ONE / GRADE- 
LEVEL) OF 10 ENTRIES EACH (ONE / LENGTH-OF-SERV I CE. CATEGORY) , 
SEPARATED BY COMMAS, WITH EACH ENTRY MADE UP OF \-k INTEGERS. 
ZEROES ARE USED WHERE BLANKS OCCUR IN THE TABLE. 

A COMPUTER FILE FOB THE ABOVE TABLE WOULD BE: 

■ 3,2,0,0,0,0,0,0,0,0, 
4,2,1 ,0,0,0,0,0,0,0, 
5,3,2,1,1,0,0,0,0,0, 
6,i»,l ,0, 1 ,2,0,0,0,0, 
6,3,1,2,0,1,0,0,0,0, 
0,2,1,0,0,0,0,0,0,0, 
0,0,2,0, Is^, 0,0,0,0, 
0,0,0,0,0 ,J^«>i, 0,0 

EXECUTION COMMJANDS! 

TO BEGIN EXECUTION OF THE "GS810" PROGRAMMING SEQUENCE, AN 
OPERATOR WILL PERFORM A CHAIN OF EXECUTION COMMANDS. THE 
ACTUAL FORM OF THESE COMMANDS WilLL DEPEND ON THE TIME-SHARING 
SYSTEM BEING USED. IN GENERAL THESE COMMANDS WILL PERFORM , 
THE FOLLOWING OPERATIONS: 
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* = COMPUTER WRITTEN 



GS810 INSTR: P(2) OF (8) 
. (FILE CODE, ENTRY) ' 



CALL UP "GS810" AND ITS FOUR SUBPROGRAMS "RLL8U0", 

■VLL810", "FTEST" AND "ANOXP" AND TRANSLATE THEM INTO 

MACHINE LANGUAGE. THIS IS THE CDMPILATI'ON PHASE. 

LOAD THE CDMPILED PROGRAM ANO SUBPROGRAMS INTO 

THE CENTRAL PROCESSING AREA AND START PROGRAM 

RUN. THIS IS THE EXECUTION PHASE. • ' 



DATA entry: 

*THIS PROGRAM ANALYZES GRADE/ 
*TIME TRENDS IN 1-8 GROUPS DF 3 
*MORE EACH; USES 8-GRADE X 10- 
*YEAR DATA ARRAYS. 



SELF-EXPLANATORY 



OR 



*Et4X^f^ FiLES VIA 6-SPACE CODE (SEE INSTR). 

thA-uggested file code consists df: 



SP (1)= (M)INORITY, (N)ON-MIN, (N)Di., ETC. 

SP (2)= (M)ALE, (F)EMALE, N(D)., (T)OTAL, ETC. 
. SP (3)-(5)= GS SERIES CODE; E.G., ( 2 ) (0 ) (.1 ) ., ( 1 8 1 0=8 1 0) 
' SP (6)'= year: 7(4J,7(5),0R (0) ( = 7^+75)' 

AfiOVE SELF-EXPLANATORY. ' USER MAY CHOOSE TO USE [JIFFERENT FILE 
COOE SYSTEM. IF SO, CODE NAMES MUST CONSIST OF NO MORE THAN 6 
LETTERS AND/OR DIGITS, IN ANY DESIRED COMBINATION. 



*HOW MANY FILES DO YOU WISH TD 
•ANALYZE? (ANS^l-8) 

*PLEASE ENTER CODE NAME OF EACH 
*FILE: f 
*1 = 



*(ERROR MESSAGE) 



SELF-EXPLANATORY. 



ENTER 6-SPACE CODE NAME OF ' 
FIRST DESIRED FILE.^ FILES ' 
WILL BE NUMBERED IN'ORDER 
Of ENTRY. 

FILE-READ ERROR; RUN ABORTED 
CHECK FOR FILE-CODE TYPING 
ERROR (ESP: 0 FDR 0), -WRONG 
CODE, OR BAD FILE. TO CHECK 
FILE,. TYPE OUT THE FILE . 
AND CHECK FOR 8 LINES, 10 . 
ENTRIES; MISSING NO.'S OR 
COMMAS, OR NO.'S WITH MORE 
THAN h DIGITS. 
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GS810 INSTRS P(3) OF. (8) 
* =. COMPUTER. WRITTEN ^ (COLUMN-VALUES) 



•DATA TAKEN FROM FILES. NEXT? 
*"StO PATTERN" DESIRED? (Y OR N) - 

H THE 'Y' OPTION IS CHOSEN, THE COMPUTER ASKS: ^ 
*Cl) 5/15 OR (2) 1/8? (1 OR 2) 

THE "STD PATTERN" OPTION ALLOWS THE OPERATOR TO SET AT THE BE- 
GINNING OF THE PROGRAM CERTAIN BASIC DECISION VARIABLES TO THE 
VALUES THEY MOST FREQUENTLY HAVE. THIS OPTION PROVIDES A FASTER 
RUN THROUGH OF THE PROGRAM. (FURTHER EXPLANATION OF THE VARIABLES 
INVOLVeO IN -THIS OPTION IS FOUND ON PAGE 6.) • 

IF 'N' IS CHOSEN, THE COMPUTER ASKS: 

(A) SELECT COLUMN-VALUE PATTERN WANTED^ 

(USE COL. midpoints: "0-1"yRS= "0.5", ETC.) 

(1) 0.5,1.5, ETC. 

(2) 1.,2., ETC. 

(3) OTHER ( 1 ,2 OR 3) 

MOST APPCICATIONS 0^ THIS PROGRAM WILi> INVOLVE THE DIVIDING OF 
EMPLOYEES INTO LENGTH-OF- SERV I CE GROUPINGS (COLUMNS). SINCE 
SUCH GROUPINGS COME FROM A CONTINUOUS DISTRIBUTION, EACH COLUMN 
WILL REPRESENT A SPECIFIED RANGE OF VALUES: 0.0-0.99 YEARS OF 
SiERVICE, 1.00-1.99 YEARS, AND SO ON. FOR COMPUTATION PURPOSES, 
THAT ONE VALUE WITHIN A GIVEN COLUMN RANGE WHICH BEST REPRESENTS 
THE AVERAGE OF THOSE INCLUDED IN THE COLUMN IS ASSUMED TO BE THE 
RANGE MIOPOIWT: 0.5 FOR A 0-1 RANGE, 1.0 FOR A 0.5-1.5 RANGE, 
AND SO FORTH. IT IS THIS MIDPOINT WHICH SHOULD BE USED AS THE. 
CQLUMN VALUE HERE. 

IN SOME CASES, THE "0.5/1.5" OR THE "1./2." PATTERNS M^Y NOT BE 
appropriate: E.G., WHERE COLUMNS COVER PERIODS OTHER THAN YEARS 
OR WHERE THE INDIVIDUAL COLUMNS REPRESENT SAMPLES TAKEN AT 
IRREGULARLY-SPACED POINTS IN TIME. IN SUCH CASES, OPTION "3" 
SHOULD BE USED. WHEN "3" IS ENTERED, COMPUTER REPLIES: 

• ENTER DESIRED VALUE FOR MIDPOINT OF COL: > 

* ■ "7 . 

• .1 

* 2 

ETC. 

NOTE THAT DECIMAL POINTS MUST -- REPEAT MUST -- BE SHOWN FOR 
ALL ENTRIES- IN THIS OPTION: THEIR OMISSION W I LL ^^^RESUL T IN : 
COMPUTATION ERRORS. 

FILES MAY CONTAIN FEWER THAN 10 COLUMNS OF DATA , W I THOUT . HARM ING 
COMPUTiATION PROGRAM OR STATISTICAL TESTS; SIMPLY FILL ANY 
UNUSED COLUMNS WITH O'S (ZEROES). 
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GS810 INSTR: PCjJ OF- (8) 

(GRADE Values! 



THE '"3" StlOH MAY ALSO BE USED TO ENTER COLUMN-VALUES FOR >»C- 
PERIOOS OTHER THAN THE FIRST 10 YEARS OF SERVICE (11-20 
YEARS. 5-15, ETC.), 

NEXT : ' . • 

*' (B) SELECT GRADE PROGRESSI^ON PATTERN 

* .. (ROW-VALUE) PATTERN WANTED: 
* 

* ( 1 ) GS-5/15 

* (2) GS-1/8 

* (3) OTHER (1.2 OR 3) 



SELF-EXPLANATORY. IF THE OPERATOR ANSWERS "1", COMPUTER NOTES-* 

* PATTERN APPLIED5 ^ 

* SO THAT "2-GRADE-lNTERVAL-SERI ES" GRADEs/pROGRESS AT THE SAM^ , 

* "1-GRADE/l-STEP" RATE BELOW GS-11 AS THEtl' 00 ABOVE GS - 1 1 ,GS -5/9 

* GRAOES WERE WEIGHTED: GS-5="8". GS^7="9"\ AND GS-9="10". 

* (0.5Q, WAS ADDED TO EACH GRAOE-VALUE AS THE MIDPOINT OF THE 

* TOJif RANGE OF JOBS IN THAT GRAOE.) 

THE TREND OF GRADES OVER Tir,E CWN BE QUANTITATIVELY ANALYZEO 
WITH FULL EFFECTIVENESS ONLY WHEN THE NUMER^ICAL WEIGHTS OF 
the' GRACES ARE PROPORTIONAL TO THE ACTUAL NUMBER OF STEPS IN- 
VOLVED IN THE GRADE-ADVANCEMENT LADDER. SINCE THE "TWO-GRAOE-/ 
INTERVAL" OCCUPATIONS BY OEFINITICN PROGRESS. BY STEPS GS-5, 7( 9, 
n, 12. 13, ETC., THE REPLACEMENT OF GRADES BY WEIGHTS IN THE \G5j;5 
TO GS-9 RANGE IS ESSENTIAL TO AVOIO NUMERICAL MISREPRESENTATION >^ 
OF THE ACTUAL STRUCTURE OF THE CAREER LADDER. 

IF THE OPERATOR ANSWERS "2" TO THIS QUESTION, THE NUMERICAL 
WEIGHTS USED REFLECT THE ACTUAL NUMBER OF STEPS . I NVOLVED IN 
THE GRADE ADVANCEMENT LADDER FOR THE LOWER-GRADED OCCUPA- 
TIONS. IN THIS CASE. THE GRADES 1. 2 AND 5 - 10 REPRESENT 
HALF STEPS WHILE GRADES 3 ANO k REPRESENT FULL STEPS. THESE 
GRADES ARE WEIGHTED ACCORDINGLY. 

IN OPTION "3". THE OPERATOR ENTERS THE EXACT WEIGHTS WANTED 
ONE BY one:, 

' * 

* ENTER MIDPOINT ( E . G . . GRD . " 1 ♦ 

* ( 1 .-1 .99)="1.5") FOR row: ' 

* 1 = 

* 2 = 

(ETC.) 

AS WITH COLUMN-VALUES. DECIMAL POINTS MUST BE SHOWN WITH^ALL 
ENTRIES OR COMPUTATION ERRQf^S'wlLL RESULT. THE HIGHEST NUMBER . 
SHOULD NOTEXCEED 99.99. ' ^ ' . 
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GS810 INSTRS P(5) OF (8) 
(TESTSOUTSIDE OR SUM) 



♦TEST SUBGROUPS AGAINST: 

* . (1) THEIR OWN SUM? 

* (2) AN OUTSIDE GROUP? 

*. J 3) NO TESTS DESIRED. (1,2 OR 3) 

OPTION 2 ABOVE,- REQUIRES THAT "OUTSIDE GROUP" BE CLEARLY DEFINED*- 

'■OUTSIDE GROUP" = A GROUP WHICH (A) IS NOT INCLUDED AMONG THE 

GROUPS WHOSE FILES ARE ENTERED IN THIS 
PROGRAM, BUT (B) WHOSE "VARIANCE FROM THE 
FITTED LINE OF REGRESSION" HAS BEEN COM- 
PUTED FROM DATA WHICH INCLUDED ALL OF THE 
DATA -- REPEAT, ALL: WITHOUT ANY EXCEP- 
TION WHATSOEVER -- WHICH IS CONTAINED IN 
THE FILES ENTERED IN THIS PROGRAM. 

STRICT OBSERVANCE OF THE LETTER OF THIS REQUIREMENT IS ESSENTIAL 
TO GET VALID RESULTS. THE EXTREMELY WIDE, ALMOST UNLIMITED RANGE 
OF APPLICATION- OF THIS PROGRAM IS IN LARGE PART DUE TO THE 
EXTREMELY WIDE RANGE OF SAMPLE SIZES (DOWN TO SAMPLES AS SMALL AS 
3) OVER WHICH THE VARIANCE-RATIO ("F" STATISTIC)' CAN B& VALIDLY 
TESTED. THE USE OF THE F-TEST WITH VARIANCES COMPUTED FROM LINES 
OF REGRESSION IS ONLY VALID, HOWEVER, WHEN THE VARIANCE QF THE 
BASE (THE "OUTSIDE" GROUP) IS COMPUTED FROM A CURVE FITTED TO; 
DATA WHICH INCLUDED ALL OF THE DATA OF EACH SUBGROUP (I.E., EACH - 
SAMPLE) WHICH IS TO BE TESTED HERE. IF THESE CONDITIONS ARE ALL 
FULLY MET, AN ANSWER OF "2" WILL YIELD THIS SEQUENCE: 

♦WHAT 6-SPACE CODE NAME SHOULD WE USE FOR THE 

♦OUTSIDE (BASE) GROUP? 

* 

•PLEASE ENTER THE FOLLOWING FOR BASE' GROUP (NAME) : 
* 

* (NAME) GRADE/TIME EQUATION 

* (E.G., 1.123'»5,.123i»56): 
* 

* (NAME) N, VARIANCE (E.G.: 1 23 , 1 . 1 23'»5 ) : 



IN ALL CASES WHERE IT IS NOT ABSOLUTELY CLEAR THAT THE CONDI- ^ 
TIONS FOR OPTION 2 A^^E FULLY MET, AS WELL AS IN ALL CASES WHERE 
GROUP-SUM COHPARISONS ARE DESIRED, OPTION 1 SHOULD BE SELECTED. 
IN THI'S OPTION, THE PROGRAM WILL ADD ALL OF THE ENTERED FILES 
INTO AN OVERALL TOTAL DISTRIBUTION AND ASK : FOR THE CODE NAME DESIRED. 
FOR THE OVERALL DISTRIBUTION. (AT THIS POINT, THE COMPUTER WILL ^ 
ALSO WRITE OUT THE CODE NAMES OF ALL OF THE FILES INCLUDED IN THAT 
TOTAL.) 

*WHAT 6-SPACE CODE SHOULD WE USE FOR 
*THE OVERALL GROUP TOTAL? 

* SUM of: (FILE COOE NAMES, UP TO 8 FILES) 

^ - 293 - 



2Ss 



= COMPUTER WRITTEN 



.•GS810 INSTR: P(6) OF (8) 
("OVERALL" SUM, DATA) 



IF THE 
COMPUTi 



JVERALL TOTAL" OPTION HAS BEEN SELECTED, 
asks: 




E) data: 
AS TYPED TABLE? 
INTO COMPUTER FILE? 
BOTH? OR 

NO WRITE-CUT WANTEO. (ANS 1-4) 



*WRITE 0 

* (1) 

* (2) 

* ( 3 ) 

* (i) 

IF "1#;1J|^STS CODE NAME OF DISTRIBUTION, 

OF THE JNOIVIDUAL GROUPS INCLUDED IN THE 



TYPES OUT 
VALUES. - 



OUT THE CODE NAMES 
OISTRIBUTION, ANO 

THE COMPLETE 8X10 MATRIX, INCLUDING COLUMN.r AND ROW- 



TYPES 
TOTAL 



*IF "2", THE FILE CONTAINING THE TOTAL DISTRIBUTION IS WRITTEN INTO 
A. COMPUTER STORAGE AREA. AT THE END OF THE PROGRAM RUN THE COM- 
PUTER WILL LIST THOSE FILES WHICH HAVE BEEN WRITTEN INTO COMPUTER 
STORAGE DURING THE CURRENT RUN. THESE FILES MAY THEN BE SAVED 
OR DELETEO AS THE OPERATOR SO DESIRES. 

IF "3", the' COMPUTER WILL BOTH TYPE OUT THE COMPLETE FILE AND 
ENTER IT INTO STORAGE. UNDER "2" AND "3", THE COMPUTER TYPES : 



♦COMPUTER FILE EN 



IF OPTION "A" IS. 
TO THE NEXT S 




,^.^^^CTZD, THE COMPUTER SKIPS THE ABO^E AND GOES 
;T£P I^CTHE "OVERALL TOTAL" OPTION AND WRITES-: 

^COMPUTED DATift FOR TOTAL OISTRIBUTION (NAME) 
* SUM of: (fjAMES OF FILES MAKING UP TOTAL) 

R= (COEF. OF CORR.) 



EQUATI 



^: 



LOG Y = (A) + (B)LOG X 



* BASE N = (NO.) BASE VARIANCE = (VALUE) 

* 

*IS A 35-YEAR CAREER PROJECTION DESIRED? (Y OR N) 

IF "Y" IS GIVEN, PROGRAM ITERATES THE ABOVE GRADE/TIME 
EQUATION 35 TIMES, BEGINNING WITH THE BASE POINT USEO IN 
FIRST COLUMN. IF "N", PROGRAM PROCEEDS TO NEXT STEP. 

UNDER THE "STD PATTERN" OPTION MENTIONED ON P^GE 3, SOME OF THE 
VARIABLES OISCUSSED ABOVE CAN BE SET AT THE START Of THE RUN 
AND THE COMPUTER WILL NOT PRINT THE QUESTIONS PERjTAINING TO THOSE 
VARIABLES. THE VARIABLES INVOLVED ARE : ■ t 

•(1) COL-VALUES — WILL BE SET TO OPTION "I" 
(2) ROW-VALUES — OPERATOR. IS 

GS-5/15 AND 
SUBGROUP TESTS -- WILL BE 
COMPUTED DATA -- WILU NOT 
35- YEAR PROJECTION -- WILL 



(3) 
(A) 
(5) 

QUESTIONS PERTAINING TO OTHER 



STILL GIVEN A CHOICE BETWEEN 
GS-1/8 

SET TO OPTION "1" 
BE PRINTEO OUT 
BE SET TO "N" 



VARIABLES 
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GS810 INSTR: P(7) OF (8) 
(TEST QUESTIONS) 



NEXT? IF EITHER OPTION 1 • OR •2" UNDER "SUBGROUP TESTS" HAS BEEN 
CHOSEN, THE COMf^UTER ASKS * 

^WRITE OUT TEST QUESTIONS? ( Y OR N ) . 

IF "Y", THE COMPUTER WRITES OUT THE QUESTIONS ANSWERED BY THE 
STATISTICAL TESTS CONTAINED IN THE PROGRAM : 

♦SUBGROUPS WERE TESTED FOR THE FOLLOWING QUESTION(S): 

* * . • 

* QUESTION (1 ): 

* ■ ■ % 

* IS THE GRADE-AUVANCEMENF CURVE OF EACH WORKFORCE SUBGROUP 
SIGNIFICANTLY DIFFERENT (HIGHER, LOWER) THAN THE NORM FOR 

^THEIR OVERALL OCCUPATIONAL GROUP * (NAME ) ? > 



QUESTIpN (2) 



* 
* 
* 
* 
* 

* > IS THE SPREAD OF SUBGROUP EMPLOYEE GRADE (SKILL) LEVELS 

* ABOVE AND BELDW THIS NORMAL GRADE-ADVANCEMENT CURVE 
*^ SIGNIFICANTLY DIFFERENT FROM THE (NAME) NORM? 

* QUESTION (3): 
* 

• * ' DOES THE OVERALL GKAUt/flME PATTERN RESULTING FROM- 
* 

* (A) THE SUBGROUP GRADE- ADVANCEMENT CURVE;, AND 

* (jX THE SUBGROUP GRADE (SKILL) SPREAD 

* VV^f'GNIF.IOANfL:r DIFFER FROM THAT OF THE (NAME) NORM? 

* THE COMPUTER THEN PRINTS OUT THE FOLLOWING PARAGRAPH EXPLAINING* 
TH^ STATISTICAL SIGNIFICANCE DF TEST RESULTS : 

♦TEST DIFFERENCES ARE CONSIDERED STATISTICALLY SIGNIFICANT WHEN THE 
♦iPROBABILITY THAT THE 1 R\DCCURRENCE COULD BE ATTRIBUTED TO CHANCE 
*IS 0.05 (1 CHANCE IN 2aS|^ OR LESS. 

IF "N», THE COMPUTER GOES DIRECTLY TO THE WRITE OUT OF THE TABLE 
WHICH SUMMARIZES THE RESULTS DF THE STATISTICAL TESTS (I.E., 
ANSWERS THE ABOVE QUESTIONS) MADE IN THE PROGRAM. (THIS TABLE 
IS PRINTED OUT IN ALL/CASES WHERE SUBGROUP DIFFERENCES A^E TESTED.) 



.um .1 



THE THREE MAIN COLUHN HEADINGS OF THIS TABLE REFER TO THE THREE 
QUESTIONS TESTED AS FOLLOWS : 

QUESTION S COLUMN HEADING 



( 1 ) ADVANCEMENT CURVES 

(2) GRD (SKILL) DISIRIB 

(3) OVERALL G/T PAtYERN 
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GS810 INSTR: P(8) DF (8) 
*= COMPUTER WRITTEN ( DATA WR I TE-OUT ; AUTO RUN) 

AFTER PRINTING- OUT THE "SUMMARY OF TEST RESULTS"/ THE COMPUTER ASKS: 

*WRtTE OUT DETAILED TEST DATA? (Y OR N) 

IF "Y", THE MACHINE PRINTS A. TABLH CONTAINING THE STATISTICAL PATA 
WHICH WAS USED IN THE STATISTICAL TESTS IN 'THE PROGRAM. DATA ARE 
PRINTED OUT FOR BOTH THE OVERALL DISTRI BUTION ■ ^OR OUTSIDE GROUP) . 
AND EACH OF THE PREVIOUSLY-ENTERED SUBGROUPS. 

IF "N", (OR, IF*'"Y", AFTER THE PRINTJ-NG OUT OF THE "SUMMARY OF 
THE STATISTICAL DATA"), THE COMPUTER WRITES : 

*NEXT: FILE WRITE-OUT: 

* { 
♦ENTER NO. OF FILE(l-8). OR • 
*9 (NO FILE WANTED) 

THIS OPTION ALLOWS THE OPERATOR TO WRITE OUT ANY FILE HE MAY 
WISH TD SEE FOR MORE' DETAILED STUDY. "FILE(l-8)" REFERS TO THE 
FILES ENTERED AT THE BEGINNING OF THE PROGRAM IN THE ORDER THEY. 
WERE ENTERED. (FILES ARE ALSO LISTED BY. NUMBER IN THE "SUMMARY 
OF STATISICAL DATA" TABLE.), AN ENTRY OF "1" CAUSES THE FIRST FILE 
TO BE PRINTED, "2" THE SECOND, AND SO ON. THESE NUMBERS MAY BE 
ENTERED IN ANY ORDER. , 

IF A "9" IS ENTERED, NO FURTHER FILES ARE WRITTEN OUT-AND THE COM- 
PUTER THEN WRITES-: 

* RUN COMPLETED. NEXT: , . . on o ^ 
*(1) AGAIN (AUTO)? (2) AGAIN (STEP BY STEP)? OR (3) QUIT. (1,2 OR 3) 

IF "1" IS GIVEN, THE PROGRAM RUNS NEW FILES THROUGH EXACTLY' THE 
SAME PROGRAM-OPTION STEPS WHICH THE OPERATOR CHOSE IN THE PREVIOUS 
RUN-THROUGH WITHOUT ANY OPERATOR PARTICIPATION OTHfR THAN EN- 
TERING THE NEW FILE CODES. IF "2" IS GIVEN, THE PROGRAM AGAIN 
STEPS THROUGH EACH OF TH£ ABOVE STEPS (EXCEPT THAT OPTION IS. 
GIVEN TjO RETAIN THE ABOVE- SELECTED COLUMN- AND GRADE- PATTERNS). 
IF "3" IS GIVEN, THE COMPUTER LISTS THE NAMES OF FILES WHICH HAVE 
BEEN PLACED IN COMPUTER STORAGE AS FOLLOWS .: 

*(N0.) FILES ENTERED DURING RUN; . . • 

*"SAVE" THOSE TO BE KEPT: 

* (NAMEl ) 

* (NAME2)- 

. . . • '''' , *• - , -f , ' ' ^ 

*rhE OPERATOR MAY THEN SAVE ANY FILES HE WISHES TO USE AGAIN. AFTER 
- THIS LISTING, THE RUN IS TERMINATED. 

note; FILES STORED DURING A PROGRAM RUN CANNOT BE ACCESSED DURING 

THAT RUN. TO USE THESE FILES, OPERATOR MUST TERMINATE THE CURRENT 
PROGRAM RUN AND BEGIN A NEW RUN. . 
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GS810 



.HIS PROGRAM ANALYZES 10-YEAR GRADE TRENDS IN 1-8 .GROUPS 
OF THREE OR r'DRE EMPLOYEES. COMPUTES lO-YE^R GRADE/TICIE 
CURVE AND VARIANCE (OR ACCEPTS THE GRADE/TIME EQUATION ' 
AND VARIANCE OF AN OUT-MDE GROUP) ANB TESTS THE GRADE 
TREND OF EACH ENTEREO. GROUP FOR STATISTICALLY SIGNIFICANT 
DIFFERENCES FBOM THE TREND OF THE OVERALL (OR OUTSTDE) • 

■GROUP. ENTERED GROUPS MUST EACH HAVE A POPULATION OF 
THREE OR MORE FOR VALID RESULTS TO BE OBJAINED. , (FOR AN 
OUTSIDE GROUP'S EQUATION AND VARIANCE PO BE USED.' AS A BASIS 
FOR ANALYZING GROUPS ENTERED *^IN THlsyROGRAM, THE OATA FOR 
THE OUTSIDE GRO^P MUST HAVE INCLUDElD AUL -- REPEiAT, ALL -- 
OF THE INDIVIDUAL GROUP DATA ENTEREO HERE. Tli&SE GROUPS, 
JHA-T IS^. MOST 6E COMPONENTS 'OF THE OUTSIDE GROUP.) 
•. . * ^ 

.DATA ARE ENTERED IN.TO THE PRSGRAM FROM PREVIOUSLY-STORED 
FILES ACCESSED VIA A 6-SBACE .COOE (EXPLAINED IN THE OPER- 
ATION MANUAL). FILES CONSIST, OF 8 CONSEcO T I VE . T YPED L INE* 
OF l^'S^TJ^If S EACH, SEPARATED BY •COMMAS, WITH EACH ENTRY 
MADE UP OF I-'* INTEGERS (OR ZEROES -- NO BLANKS)^. " 

REQUIRED subprograms: TaLSlO, VLL810, FTEST, ANOXP • 



I 



■ Gsaio v.' .. ■ : . . page "i • ' 

■ ■■OinENSlON lAR^ 

1 ' X(iO)l.Y(8),ITR(8,10),SIG(3,8),P(3), ' ' 

I NXX<36),SUMN(8),VA«GP(8.),NOTEST(8),PATDIF(8), • 

•<r N(8),R(8),A(8)*B(8),VAROWN(8),DY(8), 
.1 DIFF(8),R'ANGE(8),NDASH(36),NAMFIL(20) 
. * LOGICAL IMP. INP^IP,IQ,N0TEST..1STD 

doublIe precision diff.patdif.sig.narray.karray.idbase 
data^inp/!trueI//iq/.false./,iv/-n-/,imp/ 

. . 1 (NXX(I),I = 1 ,36)/36*:xX-/;(NDASH(I),I = l,36)/36* -- /..NSAVED/O-/ 

1 .MT/'Y'/,ISTD/.FALSE./,(DY(I)^=1 ,8)/5.2S,5.75,6.5,7.5. • 

1 8.25,8.75,9.25,9.75/ 
120 WRITE (5,130) _ . , 

130 FORMAT (//1X-,'THIS PROGRAM ANALYZES GRADE/TIME TRENDS IN. 1.-8 ) 

,WRITE (5, 150) ^ 
:i5.0 FORMAT (IX, 'GROUPS DF 3 OR MORE EACH; USES 8-GRAOE X lO-Y^AR 

1. /IX, 'DATA ARRAYS.') 

■ ■ WR I TE ( 5 21 0 ) 

~ 210 FORMAT (/IX, 'ENTER FILES VIA 6-SPACE CODE (SEE I.NSTR).') 

220 WRITE (5f2*30) 

IVS. FORMAT (///IX, 'HOW MANY FILES DD YOU WI,SH TO ANALYZE?' 
^ 1 ' (ANS 1-8)'!. . 

READ (5.260)NF 
260 FORMAT (12) 

280 FORMAT (IX. 'PLEASE FNTER CODE NAME OF EACH FILE:') 

DO 340 1=1 ,NF . ) 

WRITE (5,310)1 . / , . , ^ 

310 FORMAT (rX,I2,'=') \ - ' ' 

" READ (5,330)NARRAY( I ) \ 
330 FORMAT .(A6) 

340 CONTINUE 

DO 410 I=1,NF 
370* KARRAY=NARRAY( I ) 

CALL IFILE(20,KARRAY) ^^ . ox 

- . ' READ (20,*)<(IARRAY(I,J,K),J=l,lO),K=l,8) 
»0D ' FORMAT ( 1 0 1 4 , 7 ( / , I 0 1 4 ) ) 

MO CONTINUE -. . ' / 

WRITE (5 430.) • ■ " . _ 

■ »30 FORMAT (//I X,'DATA TAKEN FROM FILES. NEXT..*'); 

IF(INP)WRITE(5,450) ■ 
50 F0RMAT(/1X,'"STD PATTERN" D£SIRED? (Y OR N) ) ' , 

IFCINP)READ(5,56q).IE . ^ >A 

IF(.IE. EQ.'N')1STD=. FALSE. jIFdE.EQ. N ) GO 10 530 ♦ 

IF(TNP)-WRITE(5,490) . ^ 

1*90 FORMA't(/lX,'(l) 5/15 DR ( 2 ) 1/8? (1 OR 2). ) 

IF(I'NP)READ(5,260)IR ' ^ • ' 

• IF(IR^EQ.l)NYPAT=l;IF(IR.EQ.2)NYPAT=2 . 

NXPAT=1 ;LOP=l ;MDR='N';.ISTD=-TRUE. , . 

530 ■■ IF ((INP).AND.(IMP).ANO...W(iT.ISTD) WRITE .<5,540.) • ^ - 

540 ■ FORMAT (/IX, 'SAME GRADE (WT )• AND GOL-VALUES PATTERN? (Y-OR N) ) 

IF((INP). AND. tIMP). AND. ..NOY-ISTD) READ (5,560)IV .. • 

■ 'S- ■ : 300 - . *: ik 
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560 

600 
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630 

650 



760 
770 



810 
820 
830 
8A0 



870 



960 



962 
9'63 
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FORMAT (A2) . 

IF((lNP).ANb.(IMP).AND.IV.EQ.'Y') GO TO 1250 
IF(((INP).OR.(ISTD)).AND.IV.EQ.'N') GO TO 600 
GO TO 1>30. ' 
IF(.N0T.ISTD)WRITE (5,610) 

FORMAT (/6X,'(A) SELECT COLUMN-VALUE PATTERN WANTED: 
IF(.NOT.ISTD)WRITE (5,630) 
FORMAT (10X, ^(USE COL M I DPOINTS 
,IF(.NOT.ISTD)WRITE (5,65.0) 
FORMAT (/10X,'(1) 0.5,1.5, ETC.' 
1 /10X,'(2) 1 . , 2. , ETC." ' 

1 /10X,'(3) OTHER (1 ,2 OR 3)') • . 

IF(.NOT.MSTD)READ ( 5 , 260 ) . NXPAT 
IF(NXPAT.EQ.3) GO TO 760 " 
IF (NXPAT.EQ. 1 )EI=-0.5 

IF(NXBATiEQ.2)EI=0.0 ■ . • ' 

DO 7^0 ^=1,10 ' 

EX=EI+I ^ '^^ 

X(I) = EX J^, 

GO TO 830 

WRITE (5,770) 

FORMAT ('6X,'ENTFR nE'^lRFO v/M HE FOR MIDPOINT OF COL.: ) 
OQ 820 1=1,10 
WRITE ('5, 10^0)1 
READ(5,810)X( I) 
FORMAT (F6>3) 

(iONTINUE '■ ^ . ■ 

rF( .NOT. ISTD>WRI TE r5,fi'»0)- 

F0RMAT(/6X,'CB) SELECT GRADE PROGRESSION' ^ 
1 /lOX. '(ROW-VALUE) PATTrRN WANTFfl-:-0 
IFC.NOT. ISTD)WRI TE(S,^70) 
FORMAT(/10Xi,''( I') GS-S/t^' - 

I /10X,'(2) GS-l/fl' . , ; 

1 /10X,'(3) OTHER (1 ,2 OR 3) ) 
IF(.NOt. ISTD)READ(5,2 60)NYP/\T 
IF(NYPAT.EQ. 3) GO TO 990 
IF(NYPAT.EQ. 1 )EI=7.S , , 

IF(NYPAT.EQ.2)GG TO 962 . - 

DO 960 1=1,8 - 
EY=EI+I V -> 

•Y(I)=EY 

IF(1STO)GO TO 1250 • ' . 

GO -TO 1080 -r- 

DO 963 1 = 1 ,8 ■"■'■^ 

Yd )=OY(I ) 

K(ISTD) GO TO 1250 

fiO TO 1080 ' - 

1 000) - ' ' \ 

T(/6X,'ErJTER MIDPOINT ( E. G. , GRD- " 1 " = (.1 , - 1 . 99 ) = " 1 . 5" ) 

•'1 ) •^flJ''POR\ROW:') ^ ' . ' 

4*= 1,8 

'(;^, 10/10 j I 

,T2) 

flO)Y(I ) 
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TO 1250 
IF (NYPAT.NE. 1 X GO TO 
WRITE (5,1090) 

FORMAT (/IX, 'PATTERN APPLIED: ) 

WRITE (5, 1120) . „.>.^o 

FORMAT (/1X,'(S0, THAT "2-GRADE- INTERVAL-SER lES" GRADES 
i 'PROGRESS AT DHE SAME') . 
WRITE (5,1150) 
FORMAT ( 1X,'"1-GRADE/1-STEP' 
1 ' AS THEY DO ABOVE GS-1 1 
WRITE (5, 1 180) 

Format (-IX, 'grades were vifEIGHTEO: GS-5="8' 
1 ' GS-9=''10". )') 

WRITE (5,1210) . 
FORMAT (/1X,'( 0.50 WAS ADDED TO EACH GRADE.-VALUE AS THE MIO 

1 'POINT OF^ THE') i 
WRITE (5,1240) / 

FORMAT (IX, 'TOTAL RANGE OF iOBS IN TH$T GRADE.) ) ' 
IF('(INP).AND..NOT. ISTOWRITE^ (5,1260) 
F0RMAT(//1"X,'TEST SUBGROUPS AGAINST :' ,/6X 
1 'SUM?',/6X,'(2) AN OUTSIDE GROUP?') 
IF ((INP) .AND. .NOT. ISTD)WRIT.E(5, 1290) 
FORMAT(6X,*(3) NO fESTS DES I RED. V (^ , 2 OR 3 ) ) 
IF ((.INP) .AND. .NOT. I STD) READ (5,260 )LOP 
IF(L0P.EQ.3)MT='N' 
IF(MT.EQ.'N')1G0 TO 1520 
IF (LOP.EQ^I.I^MT.EQ.'H:^) GO TO 1520 

F0^MAT^(./1X^'WHAt'6-SPACE code name should we USE FOR THE" 
1 7IX, 'OUTSIDE (BASP) GROUP?') 
READ(5. 1 380)IDBASE 
FORMAT (A6) 

WRITE (5, 1400) IDBASE . 
FORMAT( IX. 'PLEASE ENTER THE FOLLOWING FOR BASE GROUP 
1 ' ',A6,':') 
WRITE (5, 1430) IDBASE • 
FORMAT (/6X,A6,' GRADE/ TIME CURVE 
1 75X, '(E.G., J. 12345, .123456): 
READ (5,*)YIN,SL 
FORMAT (2F9.6) 
WRITE (5*1480)ID3ASE 
FORMA-T (6X.,A6,' N, VARIANCE (E.G. 
READ (5,*) NBAS.VARBAS 
FORMAT -.H5,F9.6) 

GO TQ 2280V ■ . .-, , 

WRITE(S,1530) . ^ 

FORMAT (/1X,'WHAT 6-SPAtE COOE SHOULD WE USE 
1 /1X','THE OVERALL GROU^^^TUTAL?' ) 
"WRITE (S,1560)(NARRAY(I ),I=1 ,Nf ) ^. . > 

FORMAT (4X, 'SUM of:', 8( IX,' ',A6)) v 

READ (5 , 330) I DBASE • , , 

IF (INP) WRITE(5, 1 590) IDBASE 
FORMAT (/IX. 'WRITE OUT ',A6,' DATA: 

1 /6X.-(1) AS TYPED TABLE? -,/6X,'(2) INTO COMPUTER FILE?') 
- IF( INP^5?ftlTE(5, 1628) 
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FORMAT(6X.*(.3) BOTH? OR- 

I /6X,-(4)' NO WRITE-OUT WANTED. (ANS 1-4) ) 
IF(INP) READ (5,260)LP 
00 1670 1=1 ,8 

DO 1670 J=l , 10 . . 

WORK( t ,J)=0.0 

00 1740 1=1 ,NF , 

DO 1740 JJ=1 ,10 J . .' ^ 

00 1740 11=1 ,8 ^ V 

WA=IARRAY(I .JJ.I I ) * 1^ 

WORK(II,JJ)=WORK( II ,JJ)+WA f 
ITR( 1 1 ,JJ)=WORK( I I , JJ) , 

CONTINUE , . 

CALL RLL810(X, Y, WORK, YIN, SL, NBAS, lER) , 

NBASDF=NBAS-1 " 

CALL VLL810(YIN;'SL,X,Y,WORK,VARBAS,VARTOT,RBAS) 
IF(LP.EQ.4)G0 TO 2020 
IF(LP.EQ.2')G0 TO 1920 
. WRITE(5,1810)IDBASE 
F0R^1AT(/// IX, 'CUMULATED DISTRIBUTION '^AS, i ) , 
WRITE<5, 1560)(NARRAY(K) ,K=1 ,NF) 
WRITE (5,1840) (X( I ) , 1=1 . 1 0 ) 
FOR^IAT (/1X,8X, 10F6.?) 
WRITE (5, 1860) 
F0RMAT(2X,'GRD(WT )') 
I 1 = 9 

DCT 1900 IJ=1.8 
I^II-IJ 

WR\ITE(5,18.90)Y(I),(IT/^(I 
F0RMAT(2X.F';.?.7y. 101^;) 

CONTINUE 

IFflLP.EQ. 1 ) GO TO 20?0 

iti1mes=itimes + i 
if(\itimes.gt.2'«) itimes=21 
caliaof ile ( i t i me s , i dbase ) 

WRITnaTIME5,400)((ITR(I,J).J=1.10),I = l,8) 

IF CALlNSAVE OPTION IS DESIRED, REMOVE * FROM NEXT 3. LINES 
CALL SAVE( ITIMES, ISTAT) 
IF (ISTAT. EQ..O) WRITE (5,1933 ) 
1933 FORMAT (//IX, 'FILE SAVED') 
' WRITE(5, 1970) 

F0RMAT(//1X, 'COMPUTER FILE ENTERED. ) , 
'iF(ISTD)GO TO 2280 

WRITE (5,2000)ID8ASE - 

FORMAT (///IX, 'COMPUTED DATA FOR TOTAL DISTRIBUTION. , 

1 A6,':') ^ 
. WRITE (5,1560)(NARR/JY(I),I=1,NF. 

WRITE <5,2080)YIN,SL,RBAS 

FORMAT (/6X ,'EQUAT ION-" LOG Y 
. 1 ' , R='*,F9.6). 
' WRITE (5,2 1 10)N§AS,VARBAS 

FORMAT (/6X,'BASE N =',I4,' 

IF((INP). AND.. NOT. ISTD)WRITE(5, 2130) 

F0RMAT(//lX,'3i-YEAR (CAREER) PROJECTION? (Y OR N) ) 
' IF(( INP).AND..N0i;.ISTD)READ(5,560)M0R 
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IF (MOR.EQ.'N-) GO TO 2280 

WRITE (5,2170) JDBASE 
2170 FORMAT (//IX.'SS-YEAR CAREER PROJECTJON OF 

WRITE (5,1560)(NARRAY(I) ,1=1 ,NF) 

WRITE (5,2200) , ^ ^ 

2200 FORMAT (/IX,' YEAR GRADE ts^ ) 

DO 2270 1=1,35 

XI=I-1 

•xi=xi+x(i ) _ ■ 

PY=10.**(YIN+SL*ALOG10(XI)) 
. « WRITE (5,2260)XI ,PY ^ 
2260 FORMAT ( 1 X , F5 . 2 , F9. ) 
2270 CONTINUE 

2280 IF (MT.EQ.'N') GO TO 3970 

DO 2960 K=l ,NF 

i)0 2310 L=l ,3 

2310 P(L)=.99 

,NOTEST(K)=. FALSE. 

DIFF(K)=' 

PATDIFXK)=' 

RANGE(K)=' 

SUMN(K)=0.0 

DO 2410 J=l , 10 

DO 2410 1=1 ,8 

WORK(I,J) = IARRA.Y(K,J,I) 

SUMN(K)=SUMN(K)*WORK(I , J) 

2410 CONTINUE • 

IF(SUMN(K,).LT.3.) NOTEST (K )= . TRUE . 

IF(NOTEST(K))- GO TO 2880 ;^ 

CALL VLL810(YIN.«:| . v . Y . WORK . VAR2 , VTOT , RR ) 

F=VAR2/VARBAS 

VARGP(K)=VAR? 

NGMS-SUMN(<)- 1 . 

CALL FTEST (NGMS . NRA-^nr . r . PPAT ) 
P(1)=PPAT 

PATDIF(K) = '" 

IF(PPAT.GT.0.05) GO. TO 2530 
PATDIF(K)=' DIFF.' ^ 
2530 CALL RLL 8 1 0 ( X , Y , WORK , AA , 8B ^NSG , I ER ) 
- • IF(IER.EQ.O) NOTEST(K)=.TRUE. 
ir(IER.EQ.O) PATDIF(K)=' 
IF(IER.EQ.O) P(l )=.99 
IF (lER.EQ.O) GO TO 28-80 
CALL. VLL810(AA, BB,X,Y, WORK, VSG,VARS„RS) 

N(K)=NSG . . ■ 

R(K)=RS >^ 
A(K)=AA . 

B(K)=BB . 
VAROWN(K) = VSG / ^ 

IF(VAROWN(K).EQ.O.O) NOTEST (K )= . TRUE . 

; ,: IF (NOTEST(K)) GO- JO 2880 

FF=VAR2/VSG •- , • 

NSGDF=NSG-1 ^ / 

CALL FTEST (NSGDF , NSGDF„'FF ,PP ) 
DIFF(K)='. ' . 
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P(2)=PP . 
IF(PP.GT,0.05) G 
0rFF(K)=' DIFF 
IPCYIN.GT.AA.AND.SL 
IF(AA.GT.YIN.AND.BB 
FO=;VARBAS/VSG~ 
.RANGE(K)=' ' 
, IF (FD.LE. 1.) GO TO 2830 
CALL FTEST(NBASDF,NSGDF,FD,PD) 
P(3)-P0 

IF(PO.(^0.05) GO TO 2880 
RANGE(K)='': ^SMALLER' 
GO TO 2880 ^ 

"fo=i./fd • ■ 

call ftest(nsgdf,nbasdf,fd,po) 

P(3)=P0 

IF(PO.GT.0.05) GO TO 2880 
RANGE (K)'=' GREATER' • , ^ 
DO 2960 1=1,3 
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.BE) DIFF(K)= 'LOWER " 
S.L) DIf^F(K)='HlGHER' 



siG(j,K)=' "^m.^ 

I F ( . NOT. NOTE ST (K ) ) S I G ( I , K ) = ' 'M,^- 
IF(P(I).GT.0.05) GO TO 2960 

IF(P(I).LE.0.05)SrG(I,K>='<=.05 ' ^ 
IF(P(I ).LE.0.01)SIG(I ,K)='<=.01 7 
IF(P(I ) .LE..001 )SIG( I ,K)='<=.001' 
IF(P( I ) .LE..0001 )SIG( I ,K) = '<.000r 

CONTINUE 
WRITE(5,298Q) (NXX( I) . 1 = 1 .3S) 
FORMAT(/VlX,3SA2) 
IF(INP) WRITE(5,3O00) 

FORMAT (/IxrWRITE OUT TES T. QUEST IONS? (Y OR N.) ) 
IF(INP) READ (5,560) IWQ 
IF( I WQ.EQ.'N') GO TO 3^*50 
WRITE(5,30'*0) 

F0RMAT(//1X, 'SUBGROUPS WERE TESTED FOR THE FOLLOW 

1 'ING QUESTI0N(S):'",//6X, 'QUESTION (1):') 

WRITE(5,3070) , . 

F0RMAr(/9X,'I S THE GRADE- ADVANCEMENT CURVE DF EACH - 

1 'WORKFOI^CE SUBGROUP') 

WRITE (5, 3 100) ■ ■ 

F0RMAT(9)^'SIGNIFICANTLY DIFFERENT (HIGHER, LOWER) THAN 
1 ' THE NORM FDR') ^ Jt 
WRITE(5J3130)IDBASE 

FORMAT (9X, 'THEIR OVERALL OCCUPATIONAL GROUP X ,"A6, )? ) , 
WRITE(5,3150) . ^ 
T0RMAT(/6X, 'QUESTION' (2):') 
WRITE (5, 3 170) 

F0RMAT(/9X,'IS THE SPREAD OF SUBGROOT EMPLOYEE 
1 ' GRADE (SKILL) LEVELS') . 
WRI TE(5,3 2pa) ^" 

FORMAT (9X, 'ABOVE AND BELDW THIS NORMAL GRAOE-ADVANCEMENT 
1 " CURVE') » 

WRITE (5,3230)IDBASE , 
F0RMAT(9X, 'SIGNIFICANTLY DIFFERENT FROM THE ',A6, NORM? ) 
WRtTE( 5.3250) _ _ 



280 
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3250 F0RMAT(/6X, 'QUESTION (3):') 

3260 WRITE(5,3270) , • 

3270 • F0RMAT(/9X, 'DOES THE OVERALL GRADE/TIME PATTERN ' 

.1 'RESULTING from:') - . . 

WRITE(5,3300) . . n - ' 

33.00 . F0RMAT(/12X,'(A) TH£^UBGR0UP GRADE-ADVANCEMENT CURVE;' 

1 ' AND') 

WRITE(5,3330) 

3330 F0RMAT(12X,'(B) THE SUBGROUP GRADE (SKILL) SPREAD') 

WRITE(5,3350)IDBASE 
3350 F0RMAT(/9X, 'SIGNIFICANTLY DIFFER FROM THAT OF THE ' 

1 ,A6,' NORtI?') 

WRITE(5,33ao) ^ 
3380 . FORMAT (/IX.'TESJ DIFFERENCE^ ARE CONSIDERED STATISTICALLY' 

1 ^ SIGNIFICANT WHEN T-HE') . \ . 

3400 WRITE(5,3410) 

3410 FORMATdX, 'PROBABILITY THAT THEaR DCCURRENCE COULD yBE ' 
1 'ATTRIBUTED , TO CHANCE') / 
WRITE(5,3440) ' 

3440 FORMATdX.'IS 0.05 (1 CHANCE IN 20) OR LESS^') 

3450 WRITE(5.3460) " ( 

3460 F0RMAT(//1X,'KEY:',/4X,'" " = NO TEST (N<3)', 

1 /4X,'" " = NOT SIG.') 

' WRITE(5,3490)(NDASH.(I),l'=IJ|n),(NDASH(I),I = l,11) 

3490 F0RMAT(//1X. I1A2, IX, 'SUmA^ OF TEST RESULTS', IX,'-', nA2) , . 

WRITE(5,3510) • , = ^ ^ 

3510 F0RMAT(/8X, 'ADVANCEMENT CURVES GRD (SKILL) DISTRIB 
1 ' OVERALL G/T PATTERN') 

WRITE(5,3 550)(NnASH(I),I-),9).(NDASH(i),I=l,10),(NDASH(I), 

t .1 1 = 1.10) ; - 

35SO F0RMAT(8X,9A2,'- ' .2X,10A2,?X'.,I0A2) 
WRITE(5,3570) 

3570 FORMATdX, 'FILE SUBGROUP P. OIFF.S SUBGROUP P. DIFF.S' 
1 ' ^BGROUP P.- DIFF.S") 
WRITE(5,3600) 

3^600 . FORMAT (IX, 'CODE CURVE IS' ATTR CHCE SPRD IS ATTR CHCE' 
1 ' PTTRN IS ATTR CHCE') 

WRITE(5,3630)(NDASH(I),I = 1 ,9). ( NDASH( I ) ,• 1 = 1 , 1 0 ) , ( NDASH( I ) , 1 = 1 , 1 0 ). 
3630 FORMATdX,' ' . 9A2 , '-' . 2X , I 0A2 , 2X , 1 0A2 , / ) 

DO 3680 K=l ,NF A ' *v 

' . WRITE :(5,3670) NARRAY(Kf),DIFF(K),SIG(2,K) , \ 

i : RANGE'(K),SIG(3.K),PA(TDIF(K),SI=ed,K) • . 
3670 FORMATdX, A6,.2X,A6,5X,A6,3X,A8,5X,A6,4X,A6,6X,A6) 
3680 CONTINUE ' , ^ . 

WRITE (5,3700)(NDASH(I),I=1,35)« 
3700 F0RMAT(/1X,35A2) ' •• 

IF(INP) WRITE( 5. 3720), 
3 720 FORMAT (//IX, 'WRITE OUT DETAILED TEST DATA? (Y OR N)') 

IF(INP) READ (5.560)IWD 

IF(IWD.EQ.'N')GO TO 3970 

WRITE(5,3760)(NDASH(I).I=1,8),IDBASE,(NDASH(I),I=1 ,9) ^ 
3760 F0RMAT(///1X,8A2,' SUMMARY OF '.A6,' STATISTICAL DATA ',9A2) 

WRITE (5, 3 780) , 
^rmr FbRMAT(/,27X, 'GRADE/TIME EQUAT ION' , lOX, 'VAR I Al^JCE FROM:') »■ . 
• ■ WRITE(5,3800)(NDASH(I),I=1,12),(NDASH(I),I=I,8) 
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*F0R«AT(1X,'GP'. FILr,5X,-GP^ 5X,1 2(A2 ),'--, 5X,'-*, 8 (A2 ) ) 
.WRITEt5,3820)- ""N"^ ' . 

FORMAT (IX, 'NO. CODE POP. ,, (A) MB), (R) 

1 " GP. EQ. OWN EQ.'J 

WRITE (5, 3850) (NDASH( I ) , 1 = 1 , 9 )., (NQASH( I ) , I* I i 1 2 ) , (NOASH( I ) , 1 = I ,8) 
FOR«'AT( 1X,9A2,5X, V2A2 ,'-*,5X,'-',8A2,/) 
WRITE (5, 3870) I DBASE, NBAS, Y IN, SL,R.BAS,VARBAS 
F0RMATOX,A6,iiX, I6,FJ2.5,F9.5,F9.5,6X,'(NA)',F12.5,/J 

^ 00 -3950 K=l ,NF • . ESk 

. IF(.NOT.NOTEST(K))WRITE(5,39TD)K,NARRAY(K),N(K),A(K), 
r B(K),R(K),V7\RGP(K)-;-VAR0WN(K) 

FORMAT (lX,I2,':',2XiA6,lX;l6,Fl2.5,2F9.5,Fj2.5,F10.5) 
IF(NOTEST(K)) N(K)=SUMN(K) " 
^IF(NOTESTXK) )WRITE(5,39'»0)K,NARRAY(K) ,N(K) 
FORMAT( 1X.I2,':',2X,A6, IX,I6) 
CONTINUE 

WRITE(5, 3700) (NQASKjW) ,1 = 1 ,3?^ 
. IF(LP.EQ. I .0R.LP.PQ.'*.0R.L0P.EQ.2) GQTO 'iOlO . 

NSAVE0=NSAVE0+1 ' \r 
-NAMFJL (NSAVEO ) = I DBASE 

IF(MT.EQ.'N') GO TO ^i 1 50 

WRITE(5,'»030)r 

F0RMAT(//IX,'NEXT: FILE WRITE-OUT-', //IX, 'ENTER NO. OF FILE 

I '(1-8) OR', /IX, '9 (NO FILE WANTED)') 
REA0(5,260)Nf^P ' ' 
IF (NFP.EQ.9) GO TO ^i 1 50 ^ 

I ^RIT-E(5,'*080)NARRAY(NFP) 
F0RMAT<///IX,A6,' FILE:') 
WRITE(5, iS^iO) (X( I ) . 1 = 1 . in) 
WRHE(5. 1860) 

II = 9 

00 iiHO IJ=1 ,8 

I=II-IJ ^ 
WRITE (5, 1890) Y ( I ) . ( I ARP A Y ( NFP . J , I ) . J= 1 . 1 0 ) 

CONTINUE 

GO TO '»020 , 
WRITE(5,'»160)(NXX(I),I = 1,3'4) • / 

F0RMAT(//1 X,3'*A2) ' \ 

WRITE (5, ill 80) . . . \ 

F0RMAT(///,6X,'RUN COMPLETEO. NEXT:') - 
, WRITE(5,'»200) 

FORMAT( 1X,'( 1 ) AGAIN (AUTO)? (2) AGA'IN (STEP BY STEP)? OR 

1 ' (3) QUIT. ( 1 ,2 OR 3)') 
REA0(5,260)LGN - 

IF(LGN-2) lf-240,'»2 70,'»320 . . 

. INP=. FALSE. 
IQ=.TRUE. 

GO TO 2:20 . 

INP=.TRUE. . ' 

IQ=. FALSE. 
IMP=.TRUE. 
MT='Y' 
GO TO 220 

IF(NSAVE0.NE.0)WRITE(5,'»3'«0)NSAVED,JNAMFIL(NU) , , 

1 NU=1 ,NSAVEd.) 

. ' - 307 -. 



' . ■ 232 

ERIC 



GS810 ; ^^^^ ' 

kikO F0RMAT(//1K.I2.- FILES ENTEREO^DURING RUN;^ 
\ /.'"SAVE" THOSE-^ BE KEPT: ,/,(3X,A6)) 

C — • • STOP / 

. EfjD • ... 




RiVLSIp _ : - ' .PAGE 10 

SUBROUTINE RLL8 10 (X.Y, TABLE, A.B.N, I ER) 

THIS PROGRAM COMPOTES LOG( V )=A+B(LOG)X EQUATION FOR DATA 
ENTERED IN 8(Y) BY 10(X) MATRIX. RETURNS A AND B IN LDG £ORM 
DIMENSION X(10).Y(8).TABLE(10.10) 't", 
IER=1 

SUMX=0.0 . . 

- SUMY=0.0 , 
SUMXY=0'.0 

• SUMX2=0.0 
SUMN=0':0 1 
NX=0 ^ 
NCOL=0 * 
DO 18 J=l . 10 
DO rs 1=1 .8 

IF(TABLE( I .J) .NE'.0.0)NC0L=:NC0L*1 . k 

SUMX = SUMX + ALOG10(X( J) )*TA8LE(I , J,) 
^UMY=SUMY+AL0G10(Y( I ) ) *TABLE ( I . J ) , 

SUMXY=SUMXY+UL0G10(XCJ) ^*ALOG10(Y( I )n*TABLE( I . J) 
SUM)<;2 = SUMX2+(ALOG10(X( jn '^'*7.)^TABLE( I , J)" 

• SUMN=SUMN + TA3LE( I . J) 
CONTINUE 

IF(NCOL.NF.ri)NX-fl^* 1 
NCOL=0 

CONTINUE ' ^ 
If (NX.LT.2)IER-0 " 
IF( IER.FO.n)r.n m 7n 
EN=SUMN 

S..I=EN*SUMX2-SUriX*SUMX 
S2=EN*S"f^xy "^nriv*!:!!,- / 

B^S2/S1 

A-^SUMY/EN- < ' '■ 

Nrif I"(E"> 
GO TO 10 
A=0.0 
8=0.0 

N=IF IXC^UMN) 
RETURN 
END ^ 




VLUlO ^'^^^ 1' 

SUBRQUTINE VLL810 (A.8,X,Y,TA8LE,VUNEi(P,VABT0T,R) 

t yn)-X(lO) AND COMPUTES THE VARIANCE DF THE 8(Y) BY 10(^2, lo^c 

C DATA FROM THIS CiNE OF REGRESSION; COMPUTES THE TOTAL VARIANCE ^ 

C ■ -OF THE TABLE DATA AND COMPUTES THE NON-LINEAR COEFFICIENT OF 
C . CORRELATION (=SQRT(1-UNEXPLAINED VARIANCE/TOTAL VARIANCE)). 

DIMENSION X(10),Y(8),TABLE(10, 10) ^ 

SUMN=0.0 "■ 

SUNEXP^-0.0 » 

SUMY=0.0 

SUMY2=0.0 . - . 

-.;d6 16 J= 1,10 4 . 

SUNEXP=SUNEXP+TABLE(I,J)*!{Y(I)-10.**(A+B*AL0G1O(X(J))))**2 . 

SUMY=SUMY+TABLE'(I ,J)*Y( I ) 

16 SUMN=SUMN+TABLE( I ,J) 

YBAR=SUMY/SUMN 

■ DO 17 J=l,10 • 

DO 17 1 = 1,8 . . « • 

17" SUMY2=SiJMY2+TABLE(I,J)*(Y(n-YBAR)**2 ^ 

VART0T=SUMY2/(SUMN-1.) 
VUNEXP=SUNEXP/(SUMN-1 . ) 

IF(VUNEXP'.GE.VARTOT)R=0.0 ... .^Vn-r ^ 

IF(VUNEXP.LT.VA«TOT)R=SQRT( 1 .-VUNEXP/VARTOT) 

RETURN 

END ■ " . ' . 




FTEST . ; . PAGE 12 

SUBROUTINE FTEST (NGMS .NBAS , F , P ) , 
' _ APPROXIMATES THE PROBABILITY OF CHANCE OCCURRENCE OF THE 

/observed value DF F, given NGMS (= N OF GROUP TESTED — BY , 
^ DEFINITION, THE GREATElR MEAN SQUARE) AND NBAS (= N Of THE 
BASE GROUP). 

REQUIRES SUBPROGRAM'S FUNCTION ANDXP. 

(REFERENCES NBS, ABRAMOWITZ & STEGUN.AMS 55 , 9TH;26. 6 . 1 5 ,P . 94 7 

V^l-NtMS- » , > • . . " 

4. . V^2=NBAS-1 ^ ■. « , ^ • 

|fe EXil ./3. ' 

F3=F**(EX) . 
' • . F23=F3'»*2 

Bl = 2./'(9.-*Vl ) ■ 9 ' - 

B2=2./(9.*V2) 

, Tl=l .-B2 ' . 

■ T2=I.-B1 

TDP=F3*T1-T2 

B0T=SQRT(B1 *F?3*B2) 

X-TOP/BOT . ■ ' I 

Q=ANDXP{X) » 

P=l.-Q 

RETURN 
rfc)0 



• 



'- 31V - * 



^ 



ANOXP 



FUNCTION ANDXP(X) • * , .. . ^ 

ACCUMULAtiyE NORMAL DISTRIBUTION FUNCTION. APPROXIMATES P • 

FROM X (TO + . OR. .- .0000001 ) (NBS-55, P, 932)- • 

XV = ABS(X) • ' ' , 

T = +.2316419*X1) - 

ANOXP = 1 .-.3989't23*EXP(-(Xl**2)/2. )*( .31938J5*T-.3565638*T**2 
j +1 .7814 78*T**3- J .8212 56*T**4 + 1 . 3302 7'4»T**$) 
IF (X.LT.0.0) ANDXP =~1.-ANDXP , 

RETURN • 's' .... 

ENR '' . ■ . . . » ■ ■]'■ 



312 - -.r 



2S7 



APPENDIX E-4 



r 



SAMPLE' OUTPUTS' 



5TAFFIN-G NEEDS Pl.ANNl.NG, CCTMPUTER PRGGRAMJ 



^1 



' . BUREAU OF POLICIES AND STANDARDS o' 
UNITED STATES CIVIL SERVICE COMMISSION 
' WASHINGTCN, D.C.. 20A15 



31^ . 




ERIC 



; ■ Page . 1 . 7 

THE FOLLOWING PAGES SHOW THE^ONPUTER ANALYSIS OF A GROUP OF CLERICAL 
aCCUPATtpNS -IN ,A SAWPLE'^AGENCY.* THE, OCCUPAT IONS INVOLVED AREj 



'«f-">;G^-3 22 
GS-312 
Ga-203 
GS-3t8 



ctERK-TYl^iST 
CLERK- STENOGRAPHER 
PERSONNEL CLERK 
SECRETARY 



THE ANALYSIS PLvAN USED IS AS FOLLOWS-* 

. " . ^ . 

STEP FILES liSED ' , ' 

;7— — — . . ^ ' * 

. ANALYSIS OF BASIC. FILES AND GENERATION OF 

NF322,MF322 ^ 
NF312,MF312 
NF203 ,MF203 
NF318,MF318 



» > 1 
2 
3 
k 
5 
6 



NF3 22,NF3 12,Nf"2J0 3,NF3 1 8 
MF3 22,MF312,MF203,MF318 

(TERMINATE FIRST RUN) 



ANALYSIS' OF SUMMARY FILES 



TOT 
NFC 



C 



T312irOT203rTOT318 

m 

(TERMINATE SECOND RUN) 



3 22,t\t 
L.MFCL^ 



OUTPUT FILE 
MARY FHTES ■ " 



TOT322 
T0T3I2 
TOT 203 
TbT3l8 
NFCL 



•rfCL 



315 - 




i 



xxxx' xvxVxxxi^xxxxxxxyxxxxvxxxxxxxxxxxxxxxxxxyxyxxxxxx 



WRI.TE OUT TEST QUESTIONS?, (Y 'OR N^) 



key: 
It 



r =• NO- TEST (N<3)^ 
= NOT SIG.' 




SUI^MARY OF TEST RESULTS 



■I;- 



DVANCEMENT CURVES. GRO (SKILt) DISTRI8. OVERALL -G/T PATIERN 

U8GRDUP P. DIFF.S SUETGRO^P P. DIFF.S SUBGROUP P^. D IFF.. S 

CURVE IS " ATTR CHCE SPRD IS ATTR CHCE PTTRN IS ATTR gHCE 

______ ..... - — — i^.- 



^ 7 * 



• . ^ ■' ■ 

WRitE OUT, DETAILED TEST DATATJ (Y OR N) 
Y 



SUN'MARY OF TOT372 STATISTICAL DATA 

GRADE/TIME EQUATION 



GP. FILE 
NQ. CODE 


GP. 

/OP, 




(A) 








(R) 


TOT32"? 




0, 


80391 


.0 


.0^1 37 


0 


.423,91. 


1 : NF322j 


209 




80396 


0 


.03P83 


0 


.41097. 


2: MF322 


2^1 . 






0 


.0AA08 


0 


.A'3393 



















VARIANCE from: 


GP. EQ. OWN EQ. 

v/--.— — 

(NA) 0.24109 

0. 23304 "^. 23290 
0.24941 0.24894 



NEXTJ. FILE ViPITF-OlIT : 
9 (NO~TILE WANTED) / 

- . cs • ■ ■ : .■ 

yyxvxyvyyyyyvxyyyYvvyxYXYyvxvv-yvvYXYxyyyyvxyxyxyxxx.yyvxyxYX'^vvyYvxx 



RLIN CONDLETFD. N£XT: 
(AUTp 



(1) AGAIN (AUTp)?,(2j AGAIN(STEP BY STEP)? OR ( 3 ) QUIT. (1.2 OR 3) 

- 316 - ' 



ERIC 



2.90 



xxxJifxxxxyxxxxxxxxxxxxxKxxxxxyyxxxxxxyxyx-xxfxx 



key: , . 

• / . * = NO TEST (N<3) • ./ , 

, n N _ NQj. SIC, • <J J 

4 SUMMARY OF TEST RESULTS 



ADVANCEMENT CURVES GRD ( SK I LL ) STR IB OVERALL G/T PATTERN / 

_ _ . — — 

FILE SUBGROUP P. .DIFF.S slfBGROUP P. DIFF.S, SUBGROUP P. DIFF.S/ 

CODE ■ CURVE Is ATTR CHCE SPRD IS ATTR CHeE, PTTRN IS ATTR CHCE i 



NF31.2 -> — — — 

MF312^ -.— — - . ---- . 



T0T312 Statistical data --^^ f- 



SUMMARY OF 1 

GJSS£/TIME EQUATION' ^AR-I ANGE-fVoM: 



GP/ FILE GP. ---'^SSS^- — — - — -J; 

NO. CODE POP. ' (A) (B) (R) GP. EQ. -TeWN EQ. 



TOT312 68 0.8'»'»57 0.06127 'CS^^lTSO (NA) ■ 0. 322-99 

i: NF31 2 . 52 0. 8'»357 «0. 05050 0.'»67i»5 0.317'»8 0.31391.' 

2' MF312 \ 16 ' O.S^^WS 0.09112 b. 66193 0. 3635^ 0.?1311 



■TV- 



NEXT FILE write-out: ^ ■ 

ENTER NO. OF FILE<1-P) 8R ' " 

9 (NO FILE WANTFD)_ 
9 

nqxx'yVxxxxyyxxxxxxxvxyxxyvyyvvyYyxyy^Yxyyyxyy^xxxxxxxxxxxxxyxyxxyxx 

RUN COMPLETED. NEXT: 
(1) AGAIN (AUTO)? (2) ACAIN (STFP BY STFP)?,0^(3) QUIT. (1,2 OR 3) 

H ■ . . - .. 



HOW MANY FILES DO YOU WISH TO ANALYZE? (ANS 1-8) 
.2- ■ . 

PLEASF ENTER. CODE NAME OF EACH FILE: , • 



291 



ERIC 



Page, h 



•xxxJ(^o(xxxyx){^yx;<xxxxxxxxxx)(xxxxxxxxxxxx ' 

' '■W' " - NO TEST '(NO) ■'■ • ■ .' • / . 

■mi--" - N'^T src.,' . , ■ . , ' , 



c'a.__. 



■'summary, of jest results-—- -t'--- 



'W . ■.. (^ym:Et^EHJ cgRVEs grd^ (skill,) |DIst.rib i^verall g/j j^'^te^^^^ 

• FiIe subgroup "7. DITF.S SUBGROUP P/OIFF.S. SUBGROUP '. B. DIFF'.S./;- 
jCloE CURVE PS ATTR CHCE SPRD I^S 0^^"- . .Arxr, rurc . 



|F20 3 

,.,MF20 3 



V 



TTR CACE' PTTRN IS ■■ A.TTR CHCE 




SUMMARY OF TOT203 . STATISTICAL "data 



I GP. FILE/' GP.' 

; NO. coor\ pop. 



GRADE/TIME EQUATION 
(A) (B) . 



(R) 



' VARIANCE from: 
,GP. EQ'". '/dWN EQ. . . 



TOT203 

1: NF203 
2: MF203 




463 
292 



0.83079 0.090'*8 O.^i^k U '(N/f) . 38950 ' 

0.831^5 0.086Aif 0.61.705 Q . i»0946 ^ />:0 . ^ 1 047 

0.82916 C. 09759 0.69686 0.35762.^,0.35151 



next: FILE WRITf-OUT: 

« 

ENTF^ NO. OF P'lLEC 1-8) 0^ 

9 (NO FILE WANTED) 

9 



1* 



7- 



•yxxxxyvx^xyyyxyyyyx/yxxxy^^yxyxyyyyxyxYyyyxxxxvx^^Vxxxyxxxxxxxxxxxxx 

RUN COMPLETED. NEXf: . 
(1) AGAIN (AUTO)? (?) AGAIN f^TFP BY STEP)? OR (3) QUIT.jn,2 OR. .3) 

HOW ^ ANY FILES DO Y(?U'WISH TO ANALYZE? (ANS 1-80 
2 ^ 

PLEASE ENTER CODE NAr'E 0^ EACH riLE^ > 
1 = 

- 318 - 



202 



ERIC 



ADVANCEMENT CURVfes ( SK I Ll D I STR I B • OVERALL G/T J PATTERN 

f I LE V SUBGROUP P. D I FfiN.S SUBGROUP ,P. DIFF. S SUBGROUP P. .DIFF.S 
edDE ViCURVE IS ATTR CHCE SPRD IS A-TTR CHCE PTTRN 'IS /ATTR GHCE 



7 



NlF\l 9 - 



SMALLER 



'J 



<=.05 



SUMMARY 0F\T0T3 18 STATISTICAL DATA 



GP. FILE 
NO. • JCODE 



.™T3 18 

i. 

]'•■ NF318 
2: MF318 



poil 



1 27 

90 

37 



,GRADE/TtME EQUATION 



(A) 



(B) 



(R) 



0.*7«813 0.05653 0.36052 

0.790'*3 0.06-323 0.3830i» * 

0.2,9'i( 



0.7,8'»60 . 0.035'»7 0.2,9'*02 



VARIAN&E FROMJ ' 

; 

GP. EQ. OWN £Q. 



d.6'»'»57 



0.7'»782 0.7*2768, 
0.^0722 0.36359 



next: FILE WRITE-OUT: 

■ ■ t 

ENTER NO. OF<!»FILE( 1 -a) 0^ 

9 «(N0 FILE WANTFO) 

1 



,NF3>8^ file: ^, 



,. .9. 7-5 . ^ 

8.75 
8.25 • 
.7.5b 
6 .♦50 . 

or 5.^5 



.50 


i..5;o,, 


'2 * 50 


3.50 


4,50^ 


5, 50 


6. 50 


7.50 


•> o' 


J 

0 


0 


0 


0 • 


0 


' 0 


0 


* 0 


b 




• 0 


0 


0 


• 1 


■• " -0 


0 


0 




0 


0 ■ 


0, 


. 0 


0 


0 


. 0 




^ .0 


' * 2 


1 


2 




•1 


0 


- 4 


. 2 


1 


V 


3 


1 




5 


7 


k 


. 6' 


^ 5" 


3 


0 . 


H 


y 6 


1 


2 


1 


2 


3 


0 


^3, 




0 




• 0 


1 


0 


0 



NEX«1J 



F ' IL b' W^ I'TE-OUT: 



ENTER NCv OF FILE (J 
9. C.NO FILE WANTED) 
? 



•8) QR 



>.FJ1'8 ; '.file: 

* V ..Q, 

GRO(WT) 
.9:75 • 
9.25 • - 
. 8..)5- 
:' 8.2 5-- 
7.50 . 
■' 6-. 50 : 
• 5.75 
5.25. 



50 

^ 0 
0 

0 

0 

/■o 

^2 

2 

O; 



'1 .50 , 

a 

0 

0 

0 

0 

1 

2 
0 



2.50 

0 



next: file WRITE-^OUT: . 

ENTER NO. OF FILE (1-8) OR 

9 (MO FIt;E W./\NTF[I)/ 

9. ■ ■ ; ;' > 



< ■ 



50 
0, 

0 

, . 1 
' 1 

0 



6 

9 \ 
1 

3 . 
0 



0 
0 
0 
1 
1 

,3 
1 

0 



0 

0 

0 

0* 

0 

0 

1 

0 



8 ;5d. • .9 



it. 50, 5. 50 6. 50 ,7. 50 8. 50 9 



0 
0 
0 
0 
0 
0 
0 
0 



..• • / ■ ;'. • ' ' . ■ : ' \.Page 1 ' ■ i^: 

xxxxyxvxxxxxxyyxxxxxxxvxxkxxy^'-'xxyyxxx^'xyyyxyyxxxxxxyvyxxxyxyxvxxxxxxx 



i-^.. ■ .H_= NO TE.ST (N<3) . ■ . 

■ = NOT si,g; ■ . ' 

U..-. "^SUMMARY- OF TEST RESOlTS A K '—- " 

^ ADVANCer^ENT CURVES ' (SR D. ( SK IlL ) DISTRIB OVERALL G/T- PATTERN i 

FILE SUBGROUP P. DIiFF.S' SUBCRO'UP. P. DIFF.S SUBGROUP ' P. DIFF.^'>A 
'CODE .CUR\/E IS ATTR CHCE SPRD IS ATTR CHCE PTTRN IS ATTR qHCE 



.1' 



NF32? 
NF-il ? 
Nf^.203 
Nr3 1 B 



LOWER 
HIGHER 



<=.00,1 



Sr^ALLER 
SMALLER 
SMALLER 
GREATER 



<.or^oi 
<= . 0 1 , 
<=.01 . 
<=.05 



DIFF. 6-^ <.006l 



^SUMMARf OF NPCL STATI-STICAL DATA 



G-P- FILE . 

"CptJE . 


A 

\ 

GP. 
POP. 




GRADE/TIME ^QUATICN 


. • VARIANCE FROM J' 




(A) 


(B) 


(R) ^ 


GP. EQ. 


OWN Etl. 


'nfcl 


6A3 


" 0. 


81 7.'' A" 


0. 07552 


■ 

0.55855 ' 


(NA) 


' 0.54952 


i: NF322 
?: NF?1? 
3: ' NF203 
k- NF318 


209 
52 

292 
90 


0. 
P. 
0. 

• 0. 


80 3*96 
8A357 
831 
790A3 


0.03''a3 
0.05050 
0.086')A 
0.06323 


0.AI097' 
0.'»67'»5' 
0.61 305 
0. 2830') . 


0. '3 7661 " 
0.48996 . 
0.56216' 
0.97009 


.0.23290 
^0.313^1 
0.41047 
0.72768 



V 



next: FILF W7I TF'Oint 



FNTEc? NO. OF FILF( 1 - ^) OP 

9 (NO FTLF WANTFP) 

9 



XXXy vyvyyxyyyvxyyyyyyxyyyyvyxy'^'y vvxX^X'^xyv/xyXvyy^o^yyyXXyX^ 

RUN COI^PLETFD. NF/XT: . , / " 

(1) AGAIN (AUTO)? (?)^AGAIN (STEP BY STEP)? OR (37 CULT. (1.2 OR 3) 



'V/ 



• page; 8 



.V.'/'.-.. ' < "■■= NO TE.ST.'(N<3) 
. H M = MOT siG. 



_• . StJMMARY OF TEST RESULTS : -j-^.^- — --r 

ADVANCEMENT CURVES GRD (SKILL) DJSTRI.B ' OVERALL G/T'PATTE'FiN" 

FILE SUBGROUP P. DI^'F.S SUBGROUP.. P. DIFF.S SUBGROUP P. OIFF^S 
^CODE CURVE IS ATTR CHCE - SPRD 1 S • ATTR CHC£ PTTRN IS ATTR CHCE 



MF.3 2? 
Mf 31 r 
MF20 3 
■MF318 



LPWER 

HIGHER 
LOWER 



<=.05. 

<=.b J 
<= . 0 J 



SMALLER 
SM'ALLER 



<.000 J 



IFh 



<.0001 



SUMMARY OF MFCL STAJ I ST I C^L ' DATA 



, :..JL.. 



GP. FILE 
NO. CODE 


GP*. 
POP. 




GRADE/TIME EQUATION 


VARIANCE* FRQM : ' , 





(A) 


(B) 


■ (R) 


GP. EQ. OWf!|,'t.Q. 


MFCL 


<»S5 




81271 


0.082 J'S 


• 

0.620*11 


) 

(NA) ■; C.'*8.1'7'» 


i: MF32 2 
2' MF312 
3: MF203 
k'- MF319 


731 
1 6 
171 

37 


0. 
' 0. 
, 0. 

n . 


80^4 04 
8'*'l'7S 
82916 


0.04'»0 8 
P .091 1 2 
0.09759 
O.Q35'»7 


0'v45393 
0.615193 
0.69686 ' 
0.29^*02 


^^^. 32841, /* 0.2he9h . 
Tl'.6i»3'»8 0.'31 ?1 t 
0.56163 \ 0.35r51» 
1.0568? \.0*3 635'9 

' \-J- : 



next: FILF .*wni TF.n"T ■ 

ENTE^? NO. FILF ( ' ^ 
9 (NO FTi r Vi/\NTr 



9 



yyyyvvyyvxyyvyvyyy^ ^ y v v v v y >' v >;v v v y y y v y y y y y vy vy y y y y y Y yy.y y yy y v y y Y y vy 

NFXl 
) Ar 



■ ~ RUN COMPLETSX). MFXT^ ( 
(1) AGAIN (AUT0)?^(7) AGAIN ("^Tf^ RY STEP)?. OR (3) QUIT. (1,2 OR 3) 



V 



322 - 



ERIC 



. 2u 



a 



: V CUMULATED ■ 0 1 STRI BUT ION SUM 
:;f . J ;SUM Qf J T0T322; TOT3t2 

■■9; ?s 

; ..6i50 , 
V ■ 5. 75 ■ 
V; 5 .25 



Page 9 



T-OT203 TdTjia 



d.so 


1 .50 


?.50, 


3.50 


.4.50* 


5.56?t( 




.0. 


0 


. V* 0 


0 


' ' 0' 






. • .a 


. .0 


.0 ' 


1 


: Z 






' .0 ■ 


0 


0 


• 1 




I* 


a 


1 


■ 3 


9 


13 


2** 


53 ' 


■ 2't . 




1»2 


51 




AS 




2 8'' 


n9 


128 


.109 


'19 


27 


23 


17 


. 82 


' 30 


10 


2 


1 


•••^ 5 




■ 10 




0 

rr 


0 


. 0' 




..'0 



.9.5^ 



xxxxYxvxxxyxyxyxyxxxxxxxYxyyxx'xyx^yxvxvxxx'''X"XXXXX'^yvy^^^^^^ 



key: 

tf 



" = ,N0 lEST (N<3) 
■-" NOT STG. 



7 // 



SUWAPY OF 'TEST RESULTS 



ADVANCEME.NT CURVES HRQ (SKIi.L) OlSTRIB 



FILE 
CODE 




0/ « 

s ■ 

7 
3: 
6 

A " 

0- 

0 



OVERALL G/T, PATTERN 



SUBGROUP P. OIFF.S SUBGROUP P. OIFF^S SU3GR0UP 
CURVE rS' ATTR CHCE SPRO IS ATTR CHCE PTTRN IS 



P. OIFF.S 
ATTR CHCE 



TOT372 
TbT312 
Tt)T20* 
TOT318 



LOWER 
OITF. 

HIlGHER 
■LOWER 



<.0001 

<=.05 

<.O0Ol 

<=.ot 



SMALLER 
SMALLER 
.SMALLfR 
GPfEjfl^TER 



<.0P01 
<=.01 

<=.noi 

<=..0 5 



■9- 



PIFF. 

\ 



<.0001 



WRIOTE OUT OETAILFD TEST^ DATA? (Y OR N) 
Y 

^iif^MARY OF SUM STATISTICAL OATA 



GP. 
NO- 



FILF 
crODF' 



GP. 

POP, 



SRADE/TIME EQUATICM 
(A) V .M) ,^R) 



VARIANCE from: 
■ 

GP. EQ. OWN EQ. 



SUM 

1 i - TOT322 
2: TOT312f 
3- TOT203' 
At-^ TOT318 * 
( 



] 09P 


0. 


8151^ 


0 


. 07P71 


0. 


5*^R72 ' 


(NA) 


0.521 73 


J 


0. 


80391 


.0 


.OAi 37 


0. 


'♦2391 . 


0. 3A990 


• C. 2'» 1 09 




0. 


8^i*S7 


0 


.0612 7 


0. 


1 80 


0.S2&iS'< 


0^32299 


463 


0. 


83D79 


0 


.090U8 


0. 


e'.Al 3 


0.56l)08 


0 .,38950 


127 • 


0. 


7?'81,3 


n 

y 




0. 


360';2 




0.6'»l»57 










^ - 




1 

/ 
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C OMUL A TH D P I S J R LB U T I ofj S UM 
iSUM'':OF:p T0T|22 TpT3t8 



i?age 10 



.0 



, GRD(WT)' 

•■■9:75>'/- 
9.2.5/-. 



/ .p. 50 1> 50. 2. 50! 




0 
.0 
0 
G 

15 
78 
78 

7 



0 
1 2 
9^ 
28 

2 



0 

0- ' 



0 
0 
0. 
2 
12 
76 
10 
0 



3 .60- k.SO 5.50 .6.50 7..S0 . 8. 5Q'.' ■9.5b-' 'C . 



0/ 
0 
0 

n 

.;i.3;.:' 

■1 3'. : 
, 2 ■ 
■ 0 



0 ■• 

0 . 

0 

2 

6' • 
21 ' 
1 

0 



/, 



0 
0 
0 

I 

17 
19 

■5 
1 



0 . - 
1. 

I* 

10 ■ 
1^ . 



6 

o' ■' 

0 V. ' 
0.. 

1 

1 ■ . 



5* t. 1 



0 . 

'Y 
0 

o„ 
,1 

'2 
3 
0 



0. 



/ 



) 



xxxxyyxxyxyyxxyxxxxxSofxxxyxxvvv^yxyxy^^^ 



\ 



"-,= NO TESt (N<3V 
--" - NOT SIG. 

SU^II^A^Y OF TFST RE.SULrS 

ADVANCEMENT CURVES 



FILE 

coiJe 



Gf^D (SKILL) DISTRIB . OVER/VLL'- G/T PATTFRN , . , . 

SUBGROUP ■ P. DIFF.S SUBGROUP P. DIFF.'S SUSGROUF ' P. DIFF. S «j 
CURVF IS ATTR CHCE .SPRD IS . 'ATTR CHCE PTTRN IS, A,T TR <:fclCE -^^ • 

_ _ ^-l- _^ ■ ; »* -i. -^-S"-'. 



I 



TOT3:2 , : 
TOT318 ^ 

■ • ^- - - - - — 



. GP. FILE 
NO^ COCF 



• SMALLER <=.001 ■ ' 

GPEATER . <..00Ol ' v DI FF ..•■..■'''<. OOOV' ■ . 

' su^^^ary of sum statistical data 
gpadf/ti^^e' equation ; 

' (A) \ (C) V \ (^) 



GP. 

-PGP 



GRv/E"6 *.,,>:.(>.WnVeQ-. • ; 



SUM 

i: TOT32.? 
?: TOT^IS 



9f 7 

I 

177 

.A.. 



H 0. 



80??S 0.0^07? 0. ^9879 



0.35365. 

0. 80391' 0.0^ 1 3 7 0 .^7391 ^ 0.?4270 'o. 2^109 




next: file. WPI TF-OUT - ^ 

ENTER NO. OF FILE( 1-8) tOR 
•9^ (NO FILE WANTED) 
1 



r 
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i 



-man riiE: 



■ 

.9,25 ' 
:8'..75 
8.25 
7.50 
. 6.50 
5.75;' 
5.25 



0.5.0 1 .50. 2.50 



1% ■ ' 
'? . 50 



.0 
0 

LyVk 

73 

. n 



G 
0 
0 
0 
1 2 

'20 
J 



'(. 



■ ,0 ' 

• 0.; 
;^'o ■. 

10 

.0 . 
. 0- 



0 


0 .• 


' ^ 0. 


' 0 M 


0 




0 . 




-0 




0. ^ 


0 




0. 


0. ; ! 


8, 


10 


V 




6^6 




' I'O 


■1.1 ; ; 




• 0 


0 


■■ ovl 


. 0 


0 


0 


■ ■ -Oi i 



?y5p 8'i-5^'.\9\S0- -: 

^■.,\.,S;,..v,v:.v.:. 

', o: . 

■iO; ■ 
■0 



■/,.'d..- 
■ ■■■•;:6: ■ 

■, 0 ■. 

■ 0 . 



0 
2 
I 

0.. 



0 / 

0 ■ 

b 

0 

.0 

1 ' 

0 ' 



NE^T: , FrLE'\ WRITE-OUT: 

ENTER NO. Of FILE( 1-8) OR 
9 (NO FILE WANTED) 
2 

TQT^31fl FILE.: 





0.50\ 1 


.50 


2. 50 


3.50 




■5.50 


"6.50 ^ 


7.50 ;'l 




» . 5,0' 


GRD(WT) 






















9.75 
9.25 ^ 


o\ 


■ 0 


0 


0 


< 0 


0 


0 


0 




' ■ & 


0 A 


0 


0 


•/ 0 


0 


0 


1 


0 * . 




,0 


8.,75 


0 \ 


0 


0 


* 0 


0 


d. 


0 


0 - 


■ ,i0.v' 


I 


8.25 


0 \ 


0 • 


2 


0 


2 


1 




0 


0, 


, 0 


- 7.50 


1 ^ 


I 0 




3 


2 


5 




1 


'■1 


' 3 


6.50 


5 


\ ^ 


10 


? 


1 1 


P 


6 ■ 


0 


0 


''3 


' 5.75 . 




\ ^ 


6 


? 


1 


5 




I 


, '2: 


0 


5.25 


i 


\ ^ 


0 


0 


Oi 


1 


0 


.■0 




0 


next: fI-LE 


WRITE- 


out: 



















'Vvyyvvyyvyvyv 



xvxxfyyyyxxxx^'xyxxxxx 



ENTER NO. OF FILE( ]\S) OR 
9 J:nD file WANTED) » 

RUN COMPLETED. 'NRXT: 



(1) /iGAIN (ySUTO)? (7) yn/yiN (STfi" 8Y STFP)7 OR ( 3 ) loU I T-. (1 . 2 OR 3) 



1. 

HOW .MANY FILES DO YflU WISH TO A^ALY2E7 (ANS 1-8) 
2 • 

PLEASE ENTER CQgJ^ NAMF C E/\CH FILE: 
NFCL 




/ 



2 = 
NFCL, 
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2h. 



'CWMULATEO OISTRI&UTION SUM 



GROtWTJ 

9v75 . 
. 9v 25 
• 8.. 75 • 

8. 55 

7.50 .. 

6.50 
, 5.75 

5.25 



.50 

t 


"l .50 


: 2.50 


3.50 




,5.sd 


6.5:0 


7.5b 


'8.-50 

"■ » ' 


> ■ 

■ 0 


\ 

0 


'o 


0 


•0" 


■ 0, 


0 


0 


■ 


i 


0 
0 


• 


1 

f 


2 


• 7 


7 

& 


.. 1 
2 


■. .)3 




■ 3 


* 9 


13 


2k 


53 


2,1 


6 


5 


^1 




51 






/♦^ 


. 28 


k 


6 


119. 


128 


109^ 


19 


27 


23 ■■. 


17 


2 


. 2 


82 ( 




10 


2 


,1 


S 




1 


3 


10 < 


1 '? 


0 ' 


-.0 


. 0 


1 


0 


0 


0 



9.50 

0 
3 
7 

■ -3; 
' 6 

0 

0. 



,xxxxyxvxyxyxyxxxxxxxxxxxYxyxyy^'x^yxyxxvxyyxy^'y^xvxyxvvyyYxyx^'^'xxyvyxyx\ 



KEYJ 



" . " ^ NO TEST (N<3) 
It » .= NOT SIG- 



. SUMMARY' OF TEST RE^ULT^ * ----A . 

ADVANCEMENT CURVES GR D '( SK I LL )' Dl STR I B OVER'ALL G/T PATTERN* 



j FILE SUBGROUP P. DI^^F.S SUBGROUP 
COQE CU^^VE IS ^ATTR CHCE SPRD IS 



P. Dir^.S SUBGROUP 
ATTR CHCE PTTiRN l"^ 



P. DIFF.S, 
ATTR. CHCE 



NFCL 
MFCL 



GP. FILE 
NQ. CODE 



. SUM • ' 

r: "Ijfcl 

2S MFCL 



SUMMARY OF SUM • STATISTICAL DATA 

GRADE/TIME EQUATION ' VARIANCE FROM: 



GP. 

.PQP. 



1098 



(A) 



(B) 



(R) 



0,fl)51^' n. 07871 0.S8B7? 

vO. 817^4 0,07-5? e. 558^5 
0.81771 n,0P?is 0.620'4l 



GP. EO. OWN EQ, 



(NA) 
0. '55048 



0.52173 

0,54952 
0.48174 



NEXT2 FILE WRITF^OUT: 

ENTE^ NO. OF FILE(l-R) OR 
9 (NO FILE WANTED) ^ 
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HIREST TECHNICAL ANALYSIS V 

"HIREST" combines several, different types pfj|l techniques to achieve^ its 
purpose - the* estimating of group losses an^ hires over a multi-year 



»riod. 



^1 



The f irst^ksection is concerned with the estimation of loss probabilities 
for the employees whose data are contained in the file input? ^ the Start 
of the prpgram. ftie methods used for this purpose are as follows: 



1. Subroutine "LOSS" computes the probab^ity of loss due . 

to causes other than death, disability or retirement by 
^ means of the group L-P emiation. 



The "L-P"equ§^ion referred to is the ^log-probability turnover curve 

^ • / ■ \ 

equation of the group being analyzed. For a detailed explan§tioi^^of such 
equat ions, se'fe the Technical Analysis for program LOGPRO. A^ indicated.,^ 
there, L-P equations are of the form 

q(Y) = A + B(LOG X) . . . 

\ 

LOSS determines from th^ input A ancT B values the probability of retention 
fr^m point to by computing the value of the equation fgji^oints 
X = T-, and X. = 1i , and dividing the value at T2 by the val% at Ti . 
Subtracting this quotient from 1 gives the net L-P loss prdbabTlity for 



*^he indiv 



idual Tor the Tj^ -.to - T^ time period. 



2. Subroutine DEATH determines, by reference to the employee's 



sex and age, the probability of ji^th during the year 
involved. ^ " . 



Values for this subprogram are/taken from mortality tables of, the USCSC 
Chief Act:uary. A semi-log/plot of table valu^ indicates that when the 

tabled probabilities are convertedX to logarithms tteir values are 

• ^ \ ■ ' / ■ ^ ' " 

approximately "linear funttions of employee age. Accordingly a -linear curve 

of form 



log^ Y = A + BX , 

was fitted to the tabled data and the resulting equation is used in the 
subroutine in plac^ of a more cumbersome lock-up table pwrogram.* 

3. Subroutine I>3^AB compute the probability of loss due to 

disability by means similar to those ^sed in DEATig^ abqve. 

In ^ese two applications, the -equation f Us are reliable to two decimal 

•places, which is sufficient precisiofTToi?^ use in small-to-moderate-piz^ 

employee groups expected for HIREST. 

• . . V 

4t Subroutine RETIRE determines the probability of 
Retirement loss. * 

In thi&^ subroutine, the normally more cumbersome look-up table technique 
is used because*^of the need for testing for several cat^ories of re- - 

tirement eligibility. • , t 

■ " * ."^ ■ ' ■ \ ' 

/• ■• . , ' 

The loss rates shown are directly applicable only to ||pegular employees 
of the U.S; Civil Sfervicel Use of HIREST . on other populations wj^ll 
require develppment^of a new or revised RETIRE subprdgraii giving rates 
appropriate for such other jurisdictions or retireSne^ti Systems. 
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5. 1' The sum of the Idss probabilities is subtracted from 1 to 
obtain the employee' a retention probability for the time 
period speRifted. 

There 'is oae exception to step 5: in the base period, the period from 
th^ ;as-of date of the employee file to the start of the first fiscal year 
of the projection periodTsj^e annual rates derived in stef>s 3^4 are first 
multiplied by BASE, tji^ length of th^ base period in years. 

4 In this we are adopting ^he standard actuarial assumption that such 
annual probabilities are approximately linear functions o^^ the time of 

^jBxposure. Thus, for example, if the on^year probability of a.partic^ar 
kind of lo^s for a person of given age and sex is .012, we assume th^at 
the probability of such loss over, say, only a five-month base period is 
5/l2ths of the 'Annual rate or .005. 



Such linear interpolation is not needed for step 1 because the L-P ^qUatipn 
technique computes the .probability for the exact base period. Such inter- 
pMa tion is also not needed in the projection period because the time 
intervals involved are all full years. ^ 

i • . 

The second major section of HIREST is concerned, with estimating the 
numbers and types of employees to be added to the starting workforce 
during each year of the projection period and estimating losses among these 
' new .hires during the projection periqd. ^ . • * ^ 

V ' ■ ' ■ ' ■ ■ ■ 

//This process is essentially on^ of iterations.. Given the number of ^ '> . 
employees in file at the start of the base period (i.e. , the as-b^f date , - 

of the employee data file), HIREST first est jjnates the nimiber of siicji^ 

' ■■■ ... 333 - ■ ^0^ ■ - '^ '.' 
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employees who can expect to.be lo8t"^during the base period and then subtracts 



the remaining employees froiA the total employment figure "specified for 
the start of the projection period to obtain the number of additional' 
empldyees whoswill have to be on board. / 

Th^ nu^i)%^^r of additional employees needed on board by the end of th^ base 
period, however^^s not quite the total number of new employees that needv^ 
to be hired during the base perlk)d. This is because ^§ome of the people 

^ . ^ ■ 

\hat will need to be hired during the base period can be expected to xeave. 
before^nd of the base period, * 

For. eacample,. suppose the base period its one year long. Assuming that 
new hirin^^^ evenly distributed over that year^ the new hires still on 
bo^rd at the end of the base period will have an average length of service • 
of 0.5 year. Suppose furthet that the jgroup L-P ^equation shows that only 
90% of new hires into the group are still flrft at the end of 0.5 years 
fiToni date of hire. Obviously, then, if we need to have 9 new ^employees 
on board at the end of the barse period, our total base period hiring will 
have to be 10, not 9. ^ ^ * 

Thus HIREST first estimates the number of additional employees needed on 
board at the end-of the period knd then, using a^ retention rate. equivalent t 
half the base period^ estimates the number of new' hires needed. For that 
fraction of total accessions who are expected to be new hires, the 
retention rate is computed *from the gr^up L-P equation. For that 
fraction who are expected to be accessions to the group from elsewhere in 
the Service, the retention rate is one-half the average^ group rate for 
the period Involved. 



With the number of additional end-of^y ear emijloyees '^f t 'f rom (a^)~^new hires 
(b) accessions estimated, HIREST then estimates losses in these two groups 
during ea(^h subsequent period olf the prO'ject.ion. The "new" hi|es" 
employee retention Over subsequent years is estimated from the L-P 
c^rve (for 0.5 to 1.5 years, 1.5 to^'2.5 years, etc.)* Retentl^on for 
"accessions'^' employees is assumed tf^^be the^^rame as for the retained 

\ 

original group population as a -whole. \ f ^ 

This process Is then repeated to eatinuite the number of new employees 
needed for each subaequeut year ut the projections. 



Several additjiouai polats should £be mnde concerning specific features of 
HIREST and the methodological a&^umt^Llouti that it reflects. 

First, HlKKbT aska tor lut>va o£ Lhc t>Lopoi.*Llon of total accessions who 
will be outside hires aud us^^ss Lhla figure for all projection years, without 
change because it la our experience Liiat the proportion of outsid^ hires 
to other accesalous' la moaL occupaiious Is the result of continuiiig staffing 
policy aud pract ice , .aad as ts^ch, tends to hold constant . over 'ti^e 
regardless of cbianges in total employment trend. 

Second, as indicated earlier, HIKEST assumes that loss rates among new 
accessions from elsewhere in the service will be on a par, for practical 

purposes, with the loss rate among continuing employees. In, some cases, 

f 

there may be some difference .in total rate to be nofed, depending on the 
sources from which in-service accessions are drawn, or in- the relative 

- ■ X ■ • 

distribution of losses by type (proportion of retirements, etc.). In 
most cases, we would expect these differences to be minor except (a) 



.'^ ■ . ^ ^ \ ^ / . Page 6 

where the ^|.st^ng workforce contains ^ strongly disproportionate share 
. ^of retirement eligililes or (b) where the\^portion of ,tota^ accessions 
who are new hirHs is . unusually low.. In auch cases, the absolute le^l 
of losses estimated can be/expectecj^ to vary , somewhat from actual experience 
The relative effects of changing projection parameters, hcA/ever, can be' 

' . ■ : — T- • , ■ . ■ . • , ■ ^ . 

/^'expefrted to^be reliable .(turnover goes up when employment rises, ett.) 
♦ ■ ■ ■ ■ ' . • ' 

Tkird, the RIF subs^ctioTn employs a simplified method of RiF-effect 
estimation which does nd^ differentially reduce the population i^"LaSt- 
Hired, First-Out't order, since RIF situations frequently vary in this 
respect. As a result, the loss ^ates estima^ted \>y HIREST may npt show^ 
as much of a drop in the first periud(s) following a major RIF as may ^ 

gfence is unlikely 

i-to be very iarge. Cotrectiun^for this apecial^ase, however, would ^ 
require additional program features and c<|fmpiexities which are beyond the 
scope of : a basic-~type prbgraui. 



And finally, H^RKST projections show only the^ojected ''expected 
values'' - i.e., Hl^JiSi:' doea not compute the prp^able range of relial^ility 
of expected valuea, as does the LOGPKO program. This is because . ^ 
HIREST iS' a hybrid program uaiag both probabilistic methods (L-P 
technique) and deterministic tectiaiques (the a^uarial- table subroutines) 
Since the probable error range of the actuarial components is not known, 
the probable ertor of the resulting overall projections cannot be 
readily determined. 



3 'J 8 
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^ • HlREST INSTR5 P(l) OT^O) 

*= CdMPUTERl WRITTEN . . ^ 



"HIREST" IS A FORTRAN IV PROGRAM ^FOR ESTIMATING LO.^S tATES AND 
HIRJNG NEEDS fN A SPECIFIC WORK^GROUP OVER A 1-5 YEAR ^^ERIOD. 
■ IT REQUIRES AN INPUT FILE OF DATA'ON EACH GROUP EMPLOYEE (DOB, 
EOD,SEX,SCD) AND -ENTRY €F '( 1 ) Tt^E NUMBER OF EMPLOYEES IN THE" 
,LJILE,'(2) the group log-probability .loss equation^, (3) J^HE^ 
EXPECTEOL GROUP POPULATfON DURING ^ACH YEAR O^THE PROJECTION 
•PERIOD, AND (^) THE PROPORTION, £IF TOTAL GROUP ACCESSIONS WHO 
^RE EXPECTED TO BE NEW HIRtS, ^ ^ ' 

REQUIRED SUBPROGRA^IS A^^t DATE, LOSS, DEATH . ^D I.S^B . RETIRE AND^ 
ANDXP. ♦ . ■ ^ 

EXECUTION COMMANDS, ^ ^ 

/ ' . 0 • 

COMMANDS FOR (A) TRANSLATING THE MAiN PROGRAM ANO/REQUIRED 
SUBPROGRAMS INTO MACH I NE.|CANGUAGE (I,E,, THE "COMPILATION" . 
PHASE) AND (B) LOADIt^^G THE COMP I L ED^PROGR AM AND SUBPROGRAMS 
--4NT0 THB CENTRAL PROCESSING AREA AND STARTING THE RUN (I.E., 
THE "LOADING ANB EXECUTION" PHASE) VARY ACCORDING TO THE , 
COMPUTER SYSTEM BEING USED. 

TiHE '^HlREST" RUN BEGINS; 

. * ENTER S- SPACE NAMt Bl- 
* EMPLOYEE DATA FILE 

THE OPERATOR RESPuNb::^ Ui LhiLKifa. iiiL cUUt NAME OF THE GROUP- 
'EMPLOYEE DATA FILE (E G., SAMPLE UATA FILE MP80l,0R .^P3 22-) . 
(NOTE THAT AIL DATA EnTRILS ARE hOLLOwED BY A CARRIAGE-. 
RETURN.) . ' , . 

VmP801 

MP32i? 



/ 



' *'ENlER NO. UP hrtrUJYth:- ' ' 

* IN (MP801 ) FILE 

ENTER THE NUMbLK IN iNlhuLK r uKr: (Nu OhClnAL HCJINI) ANU hOLLOW 

WI TH A CR. y 

NEXT [HE COMPUTER ASKS- . * 

* ENTER (MP801 ) AS-UF ( 1 . L . , 

- * CURRENT) DATE IN MO.. YR . (h.a..o5,/b) 

ENTER THE DATE OF IHE FILE C/iLlED PREVIOUSLY. USE INTEGER FORMAT^ • 
E,G, . MAY 1975 = "05.75".* 

- (NOTE THAT ALL DATES IN THiS PRUGRA^ AND ITS FILE'S ARE EXPRESSED IN 
THIS vlA'f. INDIVIDUAL DAYS ARE IGNORED.) 

• . ' - :J3'9 - , 



ERIC 



^ . " , ^ HIREST INSTR: pfl) 0F^(3i 

^ COMPUTER WRITTEN \ ' . • / . 

yEXT: . " ^ , ^ ^. . 



* ENTER A, -B OF (MP80O L-P, EQUATION I 

THIS REFERS TO THE GRCJtJP LOG-PROBABILIT/ LOS^ EQUATION (SEE DOCU- 
'mENTATION lyCnrtOMPUTER PROGfiAM "LOGPRO'T: SUCH , EQUAT I ONS ARE OF 

THE FCgRM F'(V)=A+B(LOG X). IT IS THE A AND B COEFF ICI tNTS' WHICH 
'.ARE NEEDED HERE. DO NdT FORGET TO "TNTER THE NEGATIVE SIGN FOR 

THtB- VALUE. SHOW DEC IKAL POINTS. E.G.: * 0 

• 9022, -K ^ ' . ' , • 

NEXT THE COMPUTER ASKS: ^ 

* ENTER NO. OF FISCAL YEARS .-^^ 

* PROJECTION WANTED "(1-5) ; 

\ ■ . 

ENTER THE NUMBER DESIRED IN INTEGER FOR<^; eVg., 

NEXT THE COMPUTER ASKS FOR INPUT OF THE OAJA ITEfjIS NEEDED TO MAKE 
, THE DESIRED PROJECTION^ >■ . ^ 

* ENTER WO., YEAR OF *3 TART -X. 

* OF FIffsT FISCAL YEAR (E .G. .'i^JO, 76) 

SELF vEXPITANATORY. OOfOBER 19lf6j= "lOWfi". ' ■ 

* ENTER (MP801) POBULATIOn/ 

*^AT START OFW'Y (/l976): ^ _ ^ , • 

ENTER POPULATION DATA IN I N fEGER ,PoImAT . THE COMPUTER CONTINUES: 

* ENTER (MP801) POPOtATIQN 



Vat end of fiscal _ 



* 



197^ 



ENTER DATA INDl'CATED. COMPUTER CONTINUES TO REQUEST &ND-OF-YEAR 
POPU.LATION-tJTTFA FOR EACH OF THE YEARS OF THE PROJECTION. 

NEXT THE C''OMPutER ASKS: " - 



/ 



* ENTER FRACTIONjOF (MP801) TOTAL ' , - 

* ACCESSIONS WHO ARE. NEW HIRES (E.G., 0.25) 

ENTER THE INDICATED DATA, BEING SURE TO .SHOW THE NECESSARY DECIMAL 
POINT. ' / ^ \^/ 

FROM THIS POINT ON THE PROGRAM RUNS AUTOMATICALLY , PERFORMING 
THE NECESSARY COMPUTATIONS AND PRINTING OUT THE RESULTS IN 
TABULAR fORM. THE COMPUTER THEN ASKS- 



'-ill 



J : 



• = COMPUTER WRITTlfN 
RUjJ AGAjy? (Y^ OR 




HIREST INSTRs P(3) OF (3) 



I, A 



IF NO FURTHER RUN IS NEEDED, ENTER "N". IF A "Y" ENTERED, ^HE 

COMPUTER Then asks: 



( 1 ) F.ROM TOP|> OR J.2 ) WI TH 
(J OR 2) - 



L ' 

_^NEW EMP.T 



DATA 



IF "r*^IS ENTERED' THE PROGRAM RECYCLES TO THE FIRST STEP. IF" ."2!* 
IS ENTER€D THE RECYCLE IS-TO THE QUESTION:. ~ . / , • : 



* ENTER NO. . OF FISCAL YEA"RS 

* PROJECTION WANTED (1-5) 
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* c- 
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40 
50 

"^0 



no 

■ 'v ■ 

HO 
*l 70 



210 



: \ 

HIR^ST 

MAIN PROGRAM HiiEST. INPUTS'" F|jLE-,OF Ei|lPLdYEE DATA 
(OOfi, EOb, SEX, SCQ). "REQUIRED fNTRY^OFNCROUP L-P*.^ 
, EQUATION, FUTgRr EMPLOYMENT LEVfU;^ l\-S\iE^Rt), ANO 
. PROPORTION of; TOTAL ACCESStONS WfHO 4^RE NEW .HIRES. 
PROJECTS LOSSES OUE TO L-P CURVE, OEAJH",- D.I SAB I L 1 1 Y 
ANO RETIREMENTS; JESTfMATES R I F . LOSSES ; £S;I; iMAsi:^ 
NEW HIRES ANO ACCESSIONS.. MoyiRf 0 'jS^USf^duT INES^~ 
. OATEi. loss; DEATX; [rfS'A9?,v' RETIRE , ANOXP. (3/75) ' 
- OIMENSION- POP( 7, 14) ,MPaEG^200, 1 3 ) . SUI»1L ( 6 ) . SUMO ( 6 ) 
1 SUM0I(6),SUMR(6)i,SUMSEP(6r.Hl^(6)vHIR(6), ... ^ 



I^AGE 1 



310 



340 

350 
360 



420 
430 




2 ACC(6) ,SUMH(6) ,PCT(6,.6) , 
\.WRITE(5,50) 
FORMAK//, IX, 'ENTER 5-SPACE NAME 
1 lX,"^EnPLOYEE DATA FILE",/) 
RE>VO(5,800NriLE . . ' ' 
F0RMAT(A5) f i« 

CALL IFlLE(20,Nh ILL) 
WRriTE(S, 1 10)NFlt£ 
FORMAK//, IX, ''ENTER NU 
1 IX, 'JN ",'A5," FIl E" 
READ(x5, iSONEMP 
FORMAT ( ) 
WRI TE( 5. 1 70)NFll t 
FORMAT(// , IX.'ENTlH ■ ,At 
1 1 X,'CURRENI ) OATL IN 
RtA0(5,*)MM,MYR 
WRITE(5,210)NFIl I 
FORMAT(//, lX,"ENThK 
REA0(5,^*)A,e 
00 2^5 L^l ,NEriP 

READ( 20,7^0) (riPHLL{ i . )) , ^ i . i ;J ; 
F0RMAT( 1 31 1 ) 
CONTINUE 

WRITE(5,260)NFll t 
FORMAT(//» lX,'FIlt .A!>, ' 
WRITE(5,280) 

FORMAT( JX,"En] EH NU. OF ^ 



RiF(6) , IYR(5) 



OF 



11) 



no 



A^ u^ 
. YR. 



At); I P tUUAT ION" ,/) 



HtAu. NhXf 



f SCAl 
(1-5) 



vEARS" 



^LA*< Oh S fAH | - ; 



1 1 X, "PROJECTION WANTED 
REAO(5,310)NYPRO 
FORMAT ( I 1 ) 
NT = NYPRO-»'l 
NP0P=NYPR0^2 
WRITE('5,3^0) 
FORMAT(/, IX, "ENTER nu. 
WRITE(5,350) 

';^ORMAT( IX.'OF FIRSr FlSCAl rtAK (E.G 
REA0(5,*)M0,NYR 
I YR( 1 )=190H-NYR 
IF(NYPRO.EQ. 1 ) GO TO ^30 
00 420 I=2,NYPR0 
J=I-1 

I YR(I)=IYR(J)>1 
CONTINUE * 
00 460 1=1.7 

- 345 - 



1, ■; 



/ 



3i4 



ERIC 



HIREST • r — p^^CE 2 

00 460 J=l , U 
POP(UJ)=0.0 

460 CONTINUE 

PQP( 1 ,U)=NEMP, , 

POPC 1 , 13)=NEMp " ^ . 

CALL DATE(MO,NYR,MM,MYR,BASE) . 
480 WRITE(5,'»90)NFILE, I YR( 1 ) / 

490: . FORMAK/./, 1 X, 'ENTER ',A5,s' POPULATION',,/, : / 

1 IX, 'AT START OF FY ' 
• READ( 5,520>1<IST , 

52t) FORMAT ( I ' , • 

' P0P(2, l'»>=NST 

■* WRITE(5,550>NFILE 
550 FORMAT(//, 1 X, 'ENTER ',A5,' P0PULAT lOfj' . / , 

1 IX, 'AT END OF FISCAL. YEAR') ■ ^ 

OD 630 1=1 ,NYPRO > , 

J=I+.2 ' • 

WRITE(5,6O0)IYR( g 
600 , FORMAT X.I 4, ':',/) 4; 

RE-AD( 5,620)NEND " ' ■ ' 

620 FO(^MAT( I'*) ■ 

'•630 POP(J, l'*)=NEND 

•640 WRITE(5,6'50)NFILE ^ 

650 FORMA,T(//, 1 X, 'ENTER FRACTION OF ',A5,' TOTAL',/, 

1, 1 X, 'ACCESSIONS WHO ARE NEW HIRES (£.0^,0. 25)s',/) ' j 

READ(5\680)PCTNU 
680 FDRMjftT(F5.3) . , 

DO .760 r=1 ,6 - ' 

•SUMLC I )«).0- . f ^ 

RIF(I)=0.0 .. ' 

SUMD( I ) = 0.0'- . ,, 

-^UMDI ( I ) = 0.0 I ' i ^ 

•SUMBi I ) = 0.0 

SUMSEPd )-0.0 J 
760 CONTINUE * t\ 

^ DO 780 1 = 1 .6 . 
780 . . , Hb( I )-0.0 



DO 1 1 70 1=1 ,NEMP 
MD0B=10*MPREC( I . 1 )+MPREC( I .'?) 
MD0B=10*MPREG( I , ^ )+MPREC( I ,'♦) 
"^vJCALL DATE(M0;NYR ,MDOB,NDOB.AGF ) 
HTm=10*MPREC('I.5)+MPREC(I .6) 
NTIM=10*MPREC( I ,7)+M.PREC(I .8) 
'CALL DATE(MM.MYR.MTIM.NTIM.T1 ) 
CALL DATE(MO,NYR ,MTIM,NTIM,T2) 
. MSCD=10*MPREC(I.10)+MPREC(I,1 1 ) 
NSCD=10*MPREC( I , 1 2 ) +MPREC ( I , 1 3 ) . 
CALL 'DATECMCNYP .MSCD.NSCO.SERV) 
SEX=MPREC( I ,9) 
■'\dQ.. I 170 J=l .NT 

CALL L0SS(A,B,T1 ,T2,PL) 
CALL DEATH(AGE,SEX,PD) ^ 
CALL DISAB(A*GE,SEX,PDI ) / 
; CALL RETIRE(AGE, SEX, SERV^) 

IF( (B/\SE.EQ.O.O) iOR.'(J.NE. 1 ) ) GD TO 1000 
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; rt)=PO*BASE . . ' 

P0l=f0I*8ASE ' 
••, PftiPR»BASE^ ■■■ 
^000 TdT=PL+PD+>DI+PR, 
/ IFdOT.GT. 1.) TOt=l. 

;R(j) = l.-TOT 
IF(J.EQ.I) GO TO W)60 

R(J)=R(J)*h(K) it - 

.RK=R(K) ' J- 

1060 L=a*l , ' 

If(J.E(t1)RK=J. 
POP(L.r5)=POP(L,13)+R(J) 
. SUML(J)=SUML(J')+PL*RK 
SUrt&( J )=SUMD ( J ) ♦PD*RK 

SUMDI(J)=SUMDI(J)+PDI*RK ^ 
IFXTOT.EQ. 1.)PR=1 .-(PL+PD+PDI) 
. SjUMR(J.)=SUMR(J)+PR*RK - 

T1=T2 ' ? 
T2=T1+1. ' ' 0 , \ 

AGE=AGE+1. • ■ 

SERV=SERV+1 . 
11 70 CONTINUE 

BMULT= 1 ^/ANDXP ( A+B*ALq(^l 0 ( BASE/2 . ) ) 
B I NP=POP ( 2 , 1 A ) -Pd^,(-2 , 1 3 ) 
IF(BINP) 1260, 1210,1210 
HIR( 1 )=BINP*PCTNU*BMULT 

ACC(1)=BINP*((1 .♦P0P(2,13)/P0P(1 ,13))/2.)*(-1 
P0P(2, 1 )=BINP*PCTNU ■ 
P0P(2,2)=BINP-P0P,(2, 1 ) 

HL(1)=(HIR(1 )-P0P(2,l ))+(ACC(l)-P0P(2,2)) 
GO TO 1270 . 
1260 RIF(1\)=ABS(BINP) * 

DO 1263 L0=2,NP0P . 
POP(LO,13)=POP(LO,l3J-RIF(1 )^ 

1263 CONTINUE 

SUML(l)=SUMt(l)+RIFCl) - 
1270 T1=BASE 

J=3 , 
Si DO y370 I = 3.NP0P 

^ T2yri+i. 

CALL L0SS^.B,T> .T7.Pl ) 
KtI-1 
L=J-2 
PL=1.-PL 

POPd ,J)=POP(K,L)*PL 
T1 = T2 

J=J+2 / , 

1370 CONTINUE ^. / 

J=A 

.DO 1450 I=3,NP0P- 
L-J-2 

' RATE=POP(J,13)/fV3P(K,13) 
>^ POP(I,J)=POP(K,L)*RATE 

. , ' . ■ .- 347 - 
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■' J=J+2 

1450 CONTINUE ?. 

BMULTil ./ANDXP(A+B*ALaG10(0.5)) s . . 

DO 1 796 iy=3,NP0P 
Tl=0.5 • . , 

LH=IY-1 
SUM=0,0 

DO 1530 LSUH=3, 13" . 
SUM=SUM+POP( I Y.LSUM) 
'*1530 CONTINUE 

. BINP=POP(IY, 14)-SUH ^ . . ' 

IF(BINP)1785, 1782,1550 V - ' 

1550 HIR(LH)=BINP*P,CTNU*BMULT V • 

ACC(LH)=BINP*(( 1 .+PDP(IY, 13)/PDP(LH, l3))/2. )*( 1 .^PCTNU) 
POP( I Y, 1 )=BINP*PCTNU ' ■ y 

P0P(IY,2) = BINP-P0P(IY, 1 ) 

HL.(LH)=(HIR(LH)-POP(I Y, 1 ) ) + (ACC(LH)-P0FN; 1^2) ) 
1F(I Y.EQ.NPOP) GO TO 1796 • — \ 
71=0.5 . \ 

IJ=IY+1 ' 
.j=3 • 

no 1 7 10 I=IJ,NPCP 
72=71+1. 
, CALL L0SS(A,B',71 ,72, PL) 
PR=1.-PL 

K=I-1 . * 
L = J-2 

?0P( I , J)=POP(K,L)*PR 
71 = 72 

J=J+2 % 
1710 C0N7INUE ^ 
J = 4 * 

DO 1780 I=IJ,NPCP 

K=I-1 . ' 

LrJ-2 

RA7E=P0P(I ,13)/POP(K,13) 
POPd ,J) = POP(K.I )*RATE 
J=J + 2 
1780 C0N7INUE 

GO 70 1796 

1 782 » hIR(LH)30.0 •»» 

ACC(LH)=0.0 

GO 70' 1 796 
1'785 RIF(LH) = ABS(8II/P) 
' - HIR(LH) = 0-/) 

acc(lh)=o.1d 

SUML(LH) = SUML(Lh)+RIF(LKi) 
DO 1 ;95 LO=I Y,NPOP 

BA=POP(I Y, 13>^ / 
. POP(L0,13)=f«€P(LO,'i3)-RIF(Lh)*(PpP(LO',13J/BA) ^ 

1 795 CONII-NUE ^ ► V- 



1796 C0N7INUF 
/ 
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00 2170 1^1. NT . 

SUMSEP.( I )=SUML( I )+SUMO(I )+SUHOI (I )+SUMR(iy 
IF(I»£Q. I ) GO TO 2110 ^ 
SUML(I)=SUML(I)/SUMSEP(I) * 
SUMDd )=SUMD( I )./SOMSEP( I ) 
SUMDI ( I)=5UMDI ( I )/SUMSEP( I ) 
SUMR( I )=SUMR( I )/SUMSEP ( I ) 
.J=I+l 
SUM=0.0 

DO 1910 1^2^,2 
M-L+2 

SUM=SUM+(POP( I ,L)-POP( J,M)) 
CONTINUE 

SUW=SUM+(POP(I,13)-POP(J,13)) 
SUM=SUM+(AXC( I )-PDP(J,2) ) 
SUML(r)=SUML(I)*SUM 
SUMDd )=SUMD( I )*SUM 
SUMDI(I)=SUMDI(f)*SUM 
SUMRd ) = SUM^( I )*SUM 

.SUMM=0J3- — s. ^ , 

00 202^ L=l ,9,2 ^ ^ 

M=L+2l 

SUMM=SUMM+(PDP( I ,L)-POP( J,M) ) 
CONTINUE 

SUMLd )=SUML(I )+SUMM+(HIR( I)-PDP(J,1 )) 

SUMSEP( I )=SUML( I )+SUMD( I )+SUMDI ( I )+SUMR( I ) 

EN=PDP( I , 1 4) ' 

PtT( I , 1 )=(SDML( I )/EN)*100. ' H 

PCT( I ,2)=(SUMD( I )/EN)*100. ♦ I 

PCT(.I ,3) = (SUMDI ( I )/EN)*100. 

PCTd ,4)=(SUMR( I )/EN)*100. • ■ 

PCT( I ,5)=(SUMSFP(I )/EN)*l 00. 

CONTINUE 

DO 2200 I=1^NT 

SUMH(I)=WIR7l)^Arr(I) 

CONTINUE 

WRITE( 5-2220) , 

FDRMAT(Y//, 17X, 'SUMMARY OF ESTIMATFO I DS?ES AND GAINS') 
WRITE(5,2240) (I YR(I),I-1 ,NYPRD) 

FORMAT (//,lX,'PFn I nn:''.(^y,'(B'/\<^F) " , I , i» f 6X , U ) ) 

WRI TEf 5,2260) 

FDRMAT(/, IX, 'POPULATION:') 

WRI TE( 5,2280) (POP( I , U) , 1 = 1 ,NT) 

FORMAT( /,3Xv,, 'START', IX, 6(.6X,F4.0)) ^ 
WRITE(5,2300) (POP(I , 14) , I=2,NP0P) 
F0RMAT(3X,'EN0 ' . 6 ( 6X , f . 0 ) ) 
WRI TE(5,2320) 

FORMAT(/, 1 X,'EST. LOSSES', 6(' NO. % '))' 
WRITE(5,2340) ((SUML( I ),PCT(I , 1 )), 1=1 ,NT) 
.F0RMAT.(/,3X,'LOSS ♦ ' , 6 ( F5 . 0 , F5 . 1 ) )• 
WRITE(5,2360)((SUM0(I),PCT(I,2)),I=1,NT) 
FORMAT (3X, 'DEATH ' , 6 ( F 5 . 0 ,F5 . 1 ) ) 
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WRIfE(5,2380)((SUMDI(I),PCT(I,3).),I=l,NT) - • 

2380 FORMAT (3X,'0I.SAB. ',6(F5.0,F5. 1 ) ) 

WRITE(5,2'»00)((SUMR(I) iPCT(I,'»)), 1=4 .NT) 
2400 F0R«AT(3X, 'RETIRE ',6(F5.0,F5. 1 ) ) 

WRITE(5,2420)((SUMSEP(I).,PCT(I,5)),I=1,NT> 
,2420 F0Rf1AT(/, IX, 'TOTAL ^,6(F5sO,F5. 1 )) . 

^ . WRITE(5,2'»32)(RIF(I),I=1,NT) 
2432 FqtlMAT(/,3X,'*(RIF) -,6(F5.0,5X)) / 
WRITE(5,24<»D) i 
.2440 FORMAT (/, IX, 'EST. GAINS',/) \ 

WRITE(5,246C)(HIR(I),I=1I,NT) 
2469, FORMAT (3X, 'NEW HIRES ', 6\F 5.0, 5X)) 

WRITE(5,2480)(ACC(I),I = 1,>HU ^ 
.2480- FORMAT(3X,'ACCES^ONS',6(F5.0,5X)) . * 

WRITE(5,2500)(SUMH(I),I=1 ,NT) \ 
2500 FORMAT (/, IX, 'TOT. GAINS ' ,6( F5 . 0 , 5X ) ) \ 

WRITE(5,2520) " ' 
2520 FORMAT (///, IX, 'RUN AGAIN? (Y OR.N)'/fif 

REA0(5,2540)IX ^ ^ 
2540 F0RMAT(A1 ) ' • • 

f IFMX.EQ.'N') GO TO 2620 

WRITE(5,2570) 

2570 F0RMAT(/,1X,'(1) FROM TOP; OR (2) WITH NEW EMP.T DATA',/ 

I i1X,'(l OR 2)',/) 

.REAb"(5,2590)IZ , 
2590 FORMAT(Il) 

IFdZ.EQ.l ) GO TO 40 

GO TO 270 --^ 
2620 ^STOP 
. . END . 1 • 
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P 



5UBR0UT INE DATE (MO I .MY I ,MOF ,MY.F , DAT ) 
NY=MYI-MYF 



Nrt=Mai-MOF 

EN=NM 

£N=EN/t2. . 

ENY=N.Y 

^ . DAT=E'N Y+EN 

M RETURN 

'-^i END 
# • .. ' 



f. • . ^ ' 



1 
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SUBROUTINE L0SS(A,B,T1 ,T2, PR) 
Pf l=ANDXP(A+B*AL0G10(TV)) " 
. ..P.I2=AN0XPC A+B*AL0G1 0 ( T2 ) ) 
PR=1.-Pf2/PT1 
RETURN . 



■ DEATH ■ . /PAGE 

SUBROUTINE DEATH(AGE, SEX, PD) 

IF(SEX;EQ.K)GO TO 50 

IF(AGE.GT.60. )GC TO '♦O ' 

PD=10.**(-'».07789+.03503*AG£) 

GO -TO 60 • ■ 
<»0 PD=10.**(-'».'»43573+.040977*AG£) ■ 

GO TO 60 " ' . 

50 IF(AGE.GT.60.)G0 TO 55 

PD=l,0.*.*(-3.98803 + .02685*AGEl 

GO T'O 60 

55 PD=10.**(-5.'»90235+.05n3*AGE)J 
60 RETDTfN >- ^ 

END 



V 
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SUBROUTINE DISA8( AGE, SEX. POD 

^ IF (SEX. EQ J. I.) GO TO 50 ^ 
. POI=10.**(-t0.25073+'».-87154*ALOGIO(AGE)) 
GO TO 60 

50 PDI=?10.**(-9.05.390M.19356*ALOG10(AGE)) ' 

60 RETURN 
END 



i 
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RETIRE • . PAGE 11 

/■BiJBROUTINE RETIRE(AGE,SEX,SERV,PR'ET) 
DIMENSION RML(13),RFL(13),RMM(16),rJ^M(16) 
DATA (RM(I),I = l,I3)/.06,.b5,.p6,.n,.09,.09,.W.l,.n, ' 

1 .U,il3,.I'», I./,(RMM(I),I = 1,16)/.25,.16,.15,.15,.16, S 

2 . i2r,.19,.18,.l8,. 19,.23,.22,.2^,.23,'.62,>./,(RFL(I), ^ 
.. 3 I = l, 13)/.08,.06,.d7,.12,.l,.l,.13,.l'»,.15,.l6,.l7, 

. >* .I8,I./,(RFM(I),I = T,16)/.29,.2,.18«.18,.19,.26,.2, 
5 ^22,. 19,.18,.23,.21,.2,.22,.61 ,1./' ' 
IF((<AGE.GE.55. ).AND. (AGE.1T.60. )).AND. (SERV.GE.30.) ) 

, 1 • GO TO 1 50 
IF(( (AGE.EQ.60. ).0R. (AGE.EQ.6l . ) ) .AND. (SERV.GE. 1 5 . ) ) 
r GO TO 200 ■ - k. » 

IF((AGE. GE. 62. ). AND. (SERV.GE. 5.*) )G0 TO 250 
PRET=0.0 

GO TO 340 . ■ 

150 YR=AGE-5'». 
I YR=YR . 

IF(SEX.*EQ. 2. )PRET=RMM(I YR) 
IF(SEX.EQ, 1. )PRET=RFM(IYR) 
GO TO 3A0 
200 YR=AGE-5'».. 
IYR=YR 

IF(SEX.EQ.2.)PRET=RMM( I YR) 
IF(SEX.EQ. l.)PRET=RFM(IYR) 
GP TO 3^0 
250 IF(SERV.GE.12.)YR=AGE-5'». 

IF(SERV.LT. 12. )YR=AGE-6'l. 
IYR=YR 

'IFCSEX.EQ. 1. ) GO TO 320 % 
IF( SERV.GE. 12. )PRET=RMM(IYR) 
IFCSERV.LT. 12. )PRET=RML(IYR) 
GO TO 3'»0 

320 IFCSERV.GE. 12. )PRET=RFM( lYR) 

IF(SERV.I. T , I 7, )rnFrT "f i (I vr^ ) 

3'»o 'return 
PNn 
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FUNCTION ANDXP(X) 

ACCUMULATIVE NORMAL DISTRIBUTION FUNCTION. APPROXIMATES P > 
FROM X (TO + OR - .0000001). (NBSrSS, P". 932) 
XI = ABS(X) ' .' ■ 

T- = 1 .7( 1.. + .2316419*X1 ) . 

ANOXP = 1.-.3 999423*£XP(-(X.1**2)/2.)*(.31 93815*T-.3565638*T**2 

1 +1 .781478*1**3-1 .821256*T*tf+l .33027^*T**5) 

IF (X.LT.0.0) ANOXP = l.-ANDXf* 

RETURN 

END ' 
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RUNNING THE COMPUTER PROGRAMS^ 



2 programs ^descrt^ed In this haAdboofe are all written in pbllT^lAN IV 
d ^re designed tp be on a tine ^baring computer system. 'As much 
< as possible these programs are written in standard straightforwkrd 
FORTRAN which should adapt to any time-sharing syfltdi. Respite \this, " ! 
however, there will prbbably be system differences which will require'*a/ 
user to mak^f^ome editorial changes' in the programs, y- \^ 

■ ' . . ' ''. w ' . ^ 

-There are two points in theprocess of entering and running d prpgram 
, into the conjputer wher4 such differences will become known. T^ie*^ following 
steps give a generaf outline of the process and show \^ere system" ^ 
differences may pop up. ' ^ 



1. 



2. 



3 ' 



Enter the program into the computer line by line (from a' 
Program Listing) . > 4 ' ^ ^ ^ 

Store the program under d^ts given name (using a command such 
as "SAVE") . . \ - 

Compile the program. The command for this will vary from 
system to system' and will, in many cases, cause the computer 
to print out, compilation error messages which will be ^he 
resjult of system differences. 



p compilation error messages occur, use the system's editing. 
Procedures to change the program lines causing the messages. 
Repeat steps 3 and 4 until there are no n^oife compilation 
messages. # 

5. Run the , program,. Here again, commands fo^ this step var^ 

from system tb system and what are known as execution eri^rvf^;^ 
messages may^ccur. These messages mean that, further editing ' 
of the pro grfilro^ will be needed. 

6« / Upon the elimination of the executljpn error messaged, the ! 

program should run and give the desired .analyisis results. , 

Some of' the areas more likely to need changing^las sr result of error ' 
messages are: * 

1. Input/Output unit nxjmbers., ^ 

2. Ti^B process of calling up data files.*' 

3» The syniDols used for continuation and coipmept lines. 
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^ 4; Control characters in FORMAT statements. 

5. Line format. ^ - • 

' Use of specification stat^ents. 

Al^i main programs and subroutines ^re entered and ^tored^ separately^ , At 
run time the method of calling up a set of mSin ^rogf aia pits subroutines 
vill depend on the rules of the it tme- sharing ^stem bfetng used. 

' ' * . • ' • ' . 

When running, each main^ program will ask to be given data. The fonn 
of the' responses to these quiestipns i^ discussed both in t^e chapter* 
^pertaining to ttie program and in each program's Operation Manual . .'\ - a 

Althot^ the yirograms are written for a ttme-sharlng environment^ • they 
may be adapted for use In batch systems. 

If futther* assistance is needed either in the running ojc thes^. programs 
or in revising the programs to ada4>t tq a particular use, , -contact the 
addtess int^e Preface. , ' ^ 
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